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METHODS FOR DIAGNOSING AND MONITORING MALIGNANCIES BY 
5 SCREENING GENE COPY NUMBERS 



FIELD OF THE INVENTION 

This invention pertains to the field of cancer genetics and cytogenetics. In 
particular, this invention pertains to the identification of an association between 
10 amplification(s) deletion(s) of particular genes and cancer, 

BACKGROUND OF THE INVENTION 

Chromosome abnormalities are often associated with genetic disorders, 
degenerative diseases, and cancer. The deletion or multiplication of copies of whole 
chromosomes and the deletion or amplifications of chromosomal segments or specific 

15 regions are common occurrences in cancer (Smith (1991) Breast Cancer Res. Treat. 18: 
Suppl. 1:5-14; van deVijer (1991) Biochim. Biophys. Acta. 1072:33-50). In fact, 
amplifications and deletions of DNA sequences can be the cause of a cancer. For example, 
proto-oncogenes and tumor-suppressor genes, respectively, are frequently characteristic of 
tumorigenesis (Dutrillaux (1990) Cancer Genet Cytogenet. 49: 203-217), Clearly, the 

20 identification and cloning of specific genomic regions associated with cancer is crucial both 
to the study of tumorigenesis and in developing better means of diagnosis and prognosis. 

Studies using comparative genomic hybridization (CGH) have revealed 
approximately twenty amplified genomic regions in human breast tumors (Muleris (1994) 
Genes Chromosomes Cancer 10:160-170; Kalliioniemi (1994) Proc. Natl. Acad. Set USA 

25 91 : 2156-2160; Isola (1995) Am. J. Pathol 147:905-91 1). These regions are predicted to 
encode dominantly acting genes that may play a role in tumor progression or response to 
therapy. Three of these amplified regions have been associated with established oncogenes: 
ERBB2 at 17ql2, MYC at 8q24 and CCND1 and EMS1 at 1 lql3. In breast cancer, ERBB2 
and CCND1/EMS1 amplification and overexpression are associated with decreased life 

30 expectancy (Gaudray (1992) Mutat. Res. 276:317-328; Borg (1991) Oncogene 6:137-143). 
MYC amplification has been associated with lymph node involvement, advanced stage 

2 



cancer and an increased rate of relapse (Borg (1992) Intern. J. Cancer 51: 687-691; Berns 
(1995) Gene 159: 11-18). Clearly, the identification of additional amplified genomic regions 
associated with breast cancer or other tumor cells is critical to the study of tumorigenesis and 
in the development of cancer diagnostics. 
5 One of the amplified regions found in the CGH studies was on chromosome 

20, specifically, 20ql3. Amplification of 20ql3 was subsequently found to occur in a 
variety of tumor types and to be associated with aggressive tumor behavior. Increased 
20ql3 copy number was found in 40% of breast cancer cell lines and 18% of primary breast 
tumors (Kalliioniemi (1994) supra). Copy number gains at 20ql3 have also been reported in 

10 greater than 25% of cancers of the ovary (Iwabuchi (1995) Cancer Res. 55:6172-6180), 
colon (Schlegel (1995) Cancer Res. 55: 6002-6005), head-and-neck (Bockmuhl (1996) 
Laryngor. 75: 408-414), brain (Mohapatra (1995) Genes Chromosomes Cancer 13: 86-93), 
and pancreas (Solinas-Toldo (1996) Genes Chromosomes Cancer 20:399-407). 

The 20ql3 region was analyzed at higher resolution in breast tumors and cell 

15 lines using fluorescent in situ hybridization (FISH). A 1 .5 megabase (Mb) wide amplified 
region within 20ql3 was identified (Stokke (1995) Genomics 26: 134-137); Tanner (1994) 
Cancer Res. 54:4257- 4260). Interphase FISH revealed low-level (>1.5X) and high level 
(>3X) 20ql3 sequence amplification in 29% and 7% of breast cancers, respectively (Tanner 
(1995) Clin. Cancer Res. 1: 1455-1461). High level amplification was associated with an 

20 aggressive tumor phenotype (Tanner (1995) supra; Courjal (1996) Br. J. Cancer 74: 1984), 
Another study, using FISH to analyze 14 loci along chromosome 20q in 146 uncultured 
breast carcinomas, identified three independently amplified regions, including RMC20C001 
region at 20ql3.2 (highly amplified in 9.6% of the cases), PTPN1 region 3 Mb proximal 
(6.2%), and AIB3 region at 20ql 1 (6.2%) (Tanner (1996) Cancer Res. 56:3441-3445). 

25 Clearly, definitive characterization of amplified regions within 20ql3 and other loci would 
be an important step in the diagnosis and prognosis of these cancers. 

Increased copy number of chromosome 20q in cultured cells also has been 
associated with phenotypes characteristic of progressing tumors, including immortalization 
and genomic instability. For example, increased copy number at 20ql 1-qter has been 

30 observed frequently in human uro-epithelial cells (HUC) (Reznikoff (1994) Genes Dev. 8: 
2227-2240) and keratinocytes (Solinas-Toldo (1997) Proc. Natl. Acad. Set USA 94:3854- 
3859) after transfection with human papilloma virus (HPV)16 E7 or HPV16, respectively. 
In addition, increased copy number at 20ql3.2 has been associated with p53 independent 



genomic instability in some HPV16 E7 transfected HUC lines (Savelieva (1997) Oncogene 
14: 551-560). These studies suggest that increased expression of one or more genes on 20q 
and especially at 20ql3.2 contribute to the evolution of breast cancer and other solid tumors. 
Several candidate oncogenes have been identified as amplified on 20q, including AIB1 
5 (Anzick (1997) Science 277: 965-968), BTAK (Sen (1997) Oncogene 14: 2195-200), CAS 
(Brinkmann (1996) Genome Res. 6: 1 87-194) and TFAP2C (Williamson (1996) Genomics 
35:262-264). Clearly, definitive characterization of nucleic acid sequences in 20ql3 
associated with tumor phenotypes would be an important step in the diagnosis and prognosis 
of these cancers. The present invention fulfills these and other needs. 

10 SUMMARY OF THE INVENTION 

This invention pertains to the discovery that an amplification of the genes of 
Figure 1 or an increase in gene product activity is a marker for the presence of, progression 
of, or predisposition to, a cancer (e.g., breast cancer). Using this information, this invention 
provides methods of detecting/evaluating a predisposition to, progression of, or prognosis of 

1 5 cancer in an animal. Thus, in one embodiment, this invention provides methods of detecting 
a predisposition to cancer in an animal The methods involve providing a biological sample 
from said animal, detecting the level of a gene of Figure 1 or a gene product encoded thereby 
within the biological sample; and comparing the level of a gene of Figure 1 or a gene 
product encoded thereby with a level of a gene of Figure 1 or a gene product encoded 

20 thereby in a control sample taken from a normal, cancer-free tissue; 

where an increased level of a gene of Figure 1 or a gene product encoded 
thereby in the biological sample compared to the level of a gene of Figure 1 or a gene 
product encoded thereby in the control sample indicates the presence of a cancer in the 
animal. Similarly, an increased level of a gene of Figure 1 or a gene product encoded 

25 thereby in the sample can indicate a poor prognosis for an animal/patient known to have 
cancer, and/or a reduced survival expectancy, and/or the actual presence of a cancer. 

This invention also pertains to the discovery that a deletion of the genes of 
Figure 2 or a decrease in gene product activity is a marker for the presence of, progression 
of, or predisposition to, a cancer (e.g., breast cancer). Using this information, this invention 

30 provides methods of detecting/evaluating a predisposition to, progression of, or prognosis of 
cancer in an animal. Thus, in one embodiment, this invention provides methods of detecting 
a predisposition to cancer in an animal. The methods involve providing a biological sample 



from said animal, detecting the level of a gene of Figure 2 or a gene product encoded thereby 
within the biological sample; and comparing the level of a gene of Figure 2 or a gene 
product encoded thereby with a level of a gene of Figure 2 or a gene product encoded 
thereby in a control sample taken from a normal, cancer-free tissue; 
5 where a decreased level of a gene of Figure 2 or a gene product encoded 

thereby in the biological sample compared to the level of a gene of Figure 2 or a gene 
product encoded thereby in the control sample indicates the presence of a cancer in the 
animal. Similarly, a decreased level of a gene of Figure 2 or a gene product encoded thereby 
in the sample can indicate a poor prognosis for an animal/patient known to have cancer, 

1 0 and/or a reduced survival expectancy, and/or the actual presence of a cancer. 

In one embodiment, the level of the subject gene of Figure 1 or 2 is detected 
by determining the copy number of the genes of Figure 1 or 2 in the cells of the biological 
sample. In a particularly preferred embodiment, the copy number is measured using 
Comparative Genomic Hybridization (CGH). In another preferred embodiment, the copy 

1 5 number is determined by hybridization to an array of nucleic acid probes and in another 

particularly preferred embodiment, the Comparative Genomic Hybridization is performed on 
an array. 

In another embodiment, the level of the subject gene of Figure 1 or 2 is 
detected by measuring the level of transcribed mRNA in the biological sample (e.g., by 
20 hyridization to one or more probes in an array), wherein an increased level of RNA in said 
sample compared to RNA in said control sample indicates a predisposition to cancer. 

In still another embodiment, the level of the subject gene is detected by 
measuring the level of the gene product protein in the biological sample, where an increased 
level of the gene product protein in the sample as compared to gene product protein in said 
25 control sample indicates a predisposition to cancer. 

In the methods described herein, the animal(s) are mammals, more preferably 
mammals selected from the group of humans, non-human primates, canines, felines, 
murines, bovines, equines, porcines, and lagomorphs. 

Preferred biological samples are selected from the group consisting of excised 
30 tissue (e.g. , tissue biopsy), whole blood, serum, plasma, buccal scrape, saliva, cerebrospinal 
fluid, and urine. 

In preferred embodiments, the difference between the increased level of gene 
expression in the biological sample and the level of gene expression in said control sample is 



a statistically significant difference (e.g. the increased level of gene expression in the 
biological sample is at least about 2-fold greater, more preferably at least 4-fold greater than 
the level of gene expression in the control sample). 

This invention also provides methods of treating cancer in an animaL The 
5 methods involve performing the assays as described herein (e.g. providing a biological 
sample from said animal; detecting the level of gene expression within said biological 
sample; and comparing said level of gene expression with a level of gene expression in a 
control sample from a normal, cancer- free tissue) and selecting and performing a cancer 
therapy in those animals having an increased level of gene expression compared to the level 

1 0 of gene expression in said control sample. In preferred embodiments, the cancer therapy is 
selected from the group consisting of chemotherapy, radiation therapy, surgery, antihormone 
therapy, and immunotherapy. In some preferred embodiments, the cancer therapy is an 
adjuvant cancer therapy. 

This invention also provides methods of screening a test agent for the ability 

1 5 to inhibit proliferation of a cell expression a gene of Figure 1. The methods involve 

contacting the cell with said test agent; and detecting the level of activity of the gene of 
Figure 1, where a decreased level of activity as compared to the level of activity in a cell not 
contacted with the agent indicates that the agent inhibits proliferation of said cell. The 
detection of gene expression level can be as described herein. In some embodiments the cell 

20 is a tumor cell. In some embodiments, the cell is a hyperproliferative cell. In particularly 
preferred embodiments, the difference between said decreased level of expression of a gene 
of Figure 1 and the level of expression of a gene of Figure 1 in a cell not contacted with said 
agent is a statistically significant difference (e.g. at least 2-fold lower, more preferably at 
least 4-fold lower in the cell contacted with the test agent). 

25 This invention additionally provides methods of decreasing the proliferation 

of a cell with an elevated level of expression of a gene of Figure 1 . The methods involve 
reducing the level of gene activity in the cell using an inhibitor of gene expression. The cell 
can be a tumor cell (e.g., breast cancer cell, prostate cancer cell, colorectal cancer cell, 
leukemia cell, lymphoma, lung cancer cell, brain cancer cell, pancreatic cancer cell, colon 

30 cancer cell, and ovarian cancer cell). The cell can be a hyperproliferative cell. The cell can 
also be a metastatic cell. Preferred inhibitors include antisense oligonucleotides, ribozymes, 
repressors of gene expression, etc. 
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DEFINITIONS 

To facilitate understanding the invention, a number of terms are defined 

below. 

A "gene of Figure 1 or Figure 2" is a DNA sequence that encodes a gene 
5 product. The DNA sequence has the Gen Bank Accession Numbers set forth. The term 
gene can refer to a mutated copy of the gene, or a fragment of the gene. 

The terms "hybridizing specifically to" and "specific hybridization" and 
"selectively hybridize to," as used herein refer to the binding, duplexing, or hybridizing of a 
nucleic acid molecule preferentially to a particular nucleotide sequence under stringent 

10 conditions. The term "stringent conditions" refers to conditions under which a probe will 
hybridize preferentially to its target subsequence, and to a lesser extent to, or not at all to, 
other sequences. A "stringent hybridization" and "stringent hybridization wash conditions" 
in the context of nucleic acid hybridization (e.g., as in array, Southern or Northern 
hybridizations) are sequence dependent, and are different under different environmental 

1 5 parameters. An extensive guide to the hybridization of nucleic acids is found in, e.g., 
Tijssen (1993) Laboratory Techniques in Biochemistry and Molecular Biology- 
Hybridization with Nucleic Acid Probes part I, chapt 2, "Overview of principles of 
hybridization and the strategy of nucleic acid probe assays" Elsevier, NY ("Tijssen"). 
Generally, highly stringent hybridization mid wash conditions are selected to be about 5°C 

20 lower than the thermal melting point (T m ) for the specific sequence at a defined ionic 

strength and pH. The T m is the temperature (under defined ionic strength and pH) at which 
50% of the target sequence hybridizes to a perfectly matched probe. Very stringent 
conditions are selected to be equal to the T m for a particular probe. An example of stringent 
hybridization conditions for hybridization of complementary nucleic acids which have more 

25 than 1 00 complementary residues on an array or on a filter in a Southern or northern blot is 
42°C using standard hybridization solutions (see, e.g., Sambrook (1989) Molecular Cloning; 
A Laboratory Manual (2nd ed.) Vol. 1-3, Cold Spring Harbor Laboratory, Cold Spring 
Harbor Press, NY, and detailed discussion, below), with the hybridization being carried out 
overnight. An example of highly stringent wash conditions is 0.15 M NaCl at 72°C for 

30 about 15 minutes. An example of stringent wash conditions is a 0.2x SSC wash at 65°C for 
15 minutes (see, e.g., Sambrook supra.) for a description of SSC buffer). A typical stringent 
wash for an array hybridization is 50% formamide, 2X SSC at 50°C to 50°C. Often, a high 
stringency wash is preceded by a low stringency wash to remove background probe signal. 



An example medium stringency wash for a duplex of, e.g., more than 100 nucleotides, is lx 
SSC at 45°C for 15 minutes. An example of a low stringency wash for a duplex of, e.g., 
more than 100 nucleotides, is 4x to 6x SSC at 40°C for 15 minutes. 

The term "labeled with a detectable composition", as used herein, refers to a 
5 nucleic acid attached to a detectable composition, i.e., a label. The detection can be by, e.g., 
spectroscopic, photochemical, biochemical, immunochemical, physical or chemical means. 
For example, useful labels include 32 P, 35 S, 3 H, 14 C, 125 1, 131 I; fluorescent dyes {e.g., FITC, 
rhodamine, lanthanide phosphors, Texas red), electron-dense reagents {e.g. gold), enzymes, 
e.g., as commonly used in an ELISA {e.g., horseradish peroxidase, beta-galactosidase, 

10 luciferase, alkaline phosphatase), colorimetric labels {e.g. colloidal gold), magnetic labels 
{e.g. Dynabeads™ ), biotin, dioxigenin, or haptens and proteins for which antisera or 
monoclonal antibodies are available. The label can be directly incorporated into the nucleic 
acid, peptide or other target compound to be detected, or it can be attached to a probe or 
antibody that hybridizes or binds to the target. A peptide can be made detectable by 

1 5 incorporating predetermined polypeptide epitopes recognized by a secondary reporter {e.g., 
leucine zipper pair sequences, binding sites for secondary antibodies, transcriptional 
activator polypeptide, metal binding domains, epitope tags). Label can be attached by spacer 
arms of various lengths to reduce potential steric hindrance or impact on other useful or 
desired properties {see, e.g., Mansfield (1995) Mol Cell Probes 9: 145-156). It will be 

20 appreciated that combinations of labels can also be used. Thus, for example, in some 

embodiments, different nucleic acids may be labeled with distinguishable {e.g. differently 
colored) labels. 

The term "nucleic acid" as used herein refers to a deoxyribonucleotide or 
ribonucleotide in either single- or double-stranded form. The term encompasses nucleic 

25 acids, i.e., oligonucleotides, containing known analogues of natural nucleotides which have 
similar or improved binding properties, for the purposes desired, as the reference nucleic 
acid. The term also includes nucleic acids which are metabolized in a manner similar to 
naturally occurring nucleotides or at rates that are improved thereover for the purposes 
desired. The term also encompasses nucleic-acid-like structures with synthetic backbones. 

30 DNA backbone analogues provided by the invention include phosphodiester, 

phosphorothioate, phosphorodithioate, methylphosphonate, phosphoramidate, alkyl 
phosphotriester, sulfamate, 3'-thioacetal, methylene(methylimino), 3-N-carbamate, 
morpholino carbamate, and peptide nucleic acids (PNAs); see Oligonucleotides and 

8 



Analogues, a Practical Approach, edited by F. Eckstein, IRL Press at Oxford University 
Press (1991); Antisense Strategies, Annals of the New York Academy of Sciences, Volume 
600, Eds. Baserga and Denhardt (NY AS 1992); Milligan (1993) J. Med. Chem. 36:1923- 
1937; Antisense Research and Applications (1993, CRC Press). PNAs contain non-ionic 
5 backbones, such as N-(2-aminoethyl) glycine units. Phosphorothioate linkages are described 
in WO 97/03211; WO 96/39154; Mata (1997) Toxicol Appl Pharmacol 144:189-197. 
Other synthetic backbones encompasses by the term include methyl-phosphonate linkages or 
alternating methylphosphonate and phosphodiester linkages (Strauss-Soukup (1997) 
Biochemistry 36: 8692-8698), and benzylphosphonate linkages (Samstag (1996) Antisense 

10 Nucleic Acid DrugDev 6: 153-156). The term nucleic acid is used interchangeably with 
gene, cDNA, mRNA, oligonucleotide primer, probe and amplification product. 

The term a "nucleic acid array" as used herein is a plurality of target 
elements, each target element comprising one or more nucleic acid molecules (probes) 
immobilized on one or more solid surfaces to which sample nucleic acids can be hybridized. 

15 The nucleic acids of a target element can contain sequence(s) from specific genes or clones. 
Other target elements will contain, for instance, reference sequences. Target elements of 
various dimensions can be used in the arrays of the invention. Generally, smaller, target 
elements are preferred. Typically, a target element will be less than about 1 cm in diameter. 
Generally element sizes are from 1 \xm to about 3 mm, preferably between about 5 jam and 

20 about 1 mm. The target elements of the arrays may be arranged on the solid surface at 

different densities. The target element densities will depend upon a number of factors, such 
as the nature of the label, the solid support, and the like. One of skill will recognize that 
each target element may comprise a mixture of nucleic acids of different lengths and 
sequences. Thus, for example, a target element may contain more than one copy of a cloned 

25 piece of DNA, and each copy may be broken into fragments of different lengths. The length 
and complexity of the nucleic acid fixed onto the target element is not critical to the 
invention. One of skill can adjust these factors to provide optimum hybridization and signal 
production for a given hybridization procedure, and to provide the required resolution among 
different genes or genomic locations. In various embodiments, target element sequences 

30 will have a complexity between about 1 kb and about 1 Mb, between about 10 kb to about 
500 kb, between about 200 to about 500 kb, and from about 50 kb to about 150 kb. 

The term "probe" or a "nucleic acid probe", as used herein, is defined to be a 
collection of one or more nucleic acid fragments whose hybridization to a sample can be 



detected. The probe may be unlabeled or labeled as described below so that its binding to 
the target or sample can be detected. Particularly in the case of arrays, either probe or target 
nucleic acids may be affixed to the array. Whether the array comprises "probe" or "target" 
nucleic acids will be evident from the context. Similarly, depending on context, either the 
5 probe, the target, or both can be labeled. The probe is produced from a source of nucleic 
acids from one or more particular (preselected) portions of the genome, e.g., one or more 
clones, an isolated whole chromosome or chromosome fragment, or a collection of 
polymerase chain reaction (PCR) amplification products. The probes of the present 
invention are produced from nucleic acids found in the regions described herein. The probe 

10 or genomic nucleic acid sample may be processed in some manner, e.g., by blocking or 
removal of repetitive nucleic acids or enrichment with unique nucleic acids. The word 
"sample" may be used herein to refer not only to detected nucleic acids, but to the detectable 
nucleic acids in the form in which they are applied to the target, e.g., with the blocking 
nucleic acids, etc. The blocking nucleic acid may also be referred to separately. What 

1 5 "probe" refers to specifically is clear from the context in which the word is used. The probe 
may also be isolated nucleic acids immobilized on a solid surface (e.g., nitrocellulose, glass, 
quartz, fused silica slides), as in an array. In some embodiments, the probe may be a 
member of an array of nucleic acids as described, for instance, in WO 96/17958. Techniques 
capable of producing high density arrays can also be used for this purpose {see, e.g., Fodor 

20 (1991) Science 767-773; Johnston (1998) Cum Biol. 8: R171-R174; Schummer (1997) 
Biotechniques 23: 1087-1092; Kern (1997) Biotechniques 23: 120-124; U.S. Patent No. 
5,143,854). One of skill will recognize that the precise sequence of the particular probes 
described herein can be modified to a certain degree to produce probes that are "substantially 
identical" to the disclosed probes, but retain the ability to specifically bind to (i.e., hybridize 

25 specifically to) the same targets or samples as the probe from which they were derived (see 
discussion above). Such modifications are specifically covered by reference to the 
individual probes described herein. 

The term "sample of human nucleic acid" as used herein refers to a sample 
comprising human DNA or RNA in a form suitable for detection by hybridization or 

30 amplification. The nucleic acid may be isolated, cloned or amplified; it may be, e.g., 

genomic DNA, mRNA, or cDNA from a particular chromosome, or selected sequences (e.g. 
particular promoters, genes, amplification or restriction fragments, cDNA, etc.) within 
particular amplicons or deletions disclosed here. The nucleic acid sample may be extracted 

10 



from particular cells or tissues. The cell or tissue sample from which the nucleic acid sample 
is prepared is typically taken from a patient suspected of having cancer associated with the 
amplicon amplification or deletion or translocation being detected. Methods of isolating cell 
and tissue samples are well known to those of skill in the art and include, but are not limited 
5 to, aspirations, tissue sections, needle biopsies, and the like. Frequently the sample will be a 
"clinical sample" which is a sample derived from a patient, including sections of tissues such 
as frozen sections or paraffin sections taken for histological purposes. The sample can also 
be derived from supernatants (of cells) or the cells themselves from cell cultures, cells from 
tissue culture and other media in which it may be desirable to detect chromosomal 

10 abnormalities or determine amplicon copy number. In some cases, the nucleic acids may be 
amplified using standard techniques such as PCR, prior to the hybridization. The sample 
may be isolated nucleic acids immobilized on a solid. In one embodiment, the sample may 
be prepared such that individual nucleic acids remain substantially intact and typically 
comprises interphase nuclei prepared according to standard techniques. 

1 5 The phrase "detecting a cancer" refers to the ascertainment of the presence or 

absence of cancer in an animal. "Detecting a cancer" can also refer to obtaining indirect 
evidence regarding the likelihood of the presence of cancerous cells in the animal or to the 
likelihood or predilection to development of a cancer. Detecting a cancer can be 
accomplished using the methods of this invention alone, or in combination with other 

20 methods or in light of other information regarding the state of health of the animal. 

A "cancer" in an animal refers to the presence of cells possessing 
characteristics typical of cancer-causing cells, such as uncontrolled proliferation, 
immortality, metastatic potential, rapid growth and proliferation rate, and certain 
characteristic morphological features. Often, cancer cells will be in the form of a tumor, but 

25 such cells may exist alone within an animal, or may be a non-tumorigenic cancer cell, such 
as a leukemia cell. Cancers include, but are not limited to breast cancer, lung cancer, 
bronchus cancer, colorectal cancer, prostate cancer, pancreas cancer, stomach cancer, 
ovarian cancer, urinary bladder cancer, brain or central nervous system cancer, peripheral 
nervous system cancer, esophageal cancer, cervical cancer, a melanoma, uterine or 

30 endometrial cancer, cancer of the oral cavity or pharynx, liver cancer, kidney cancer, testis 
cancer, biliary tract cancer, small bowel or appendix cancer, salivary gland cancer, thyroid 
gland cancer, adrenal gland cancer, osteosarcoma, and a chondrosarcoma. 



11 



An "animal" refers to a member of the kingdom Animalia, characterized by 
multicellularity, the possession of a nervous system, voluntary movement, internal digestion, 
etc. An "animal" can be a human or other mammal. Preferred animals include humans, 
non-human primates, and other mammals. Thus, it will be recognized that the methods of 
this invention contemplate veterinary applications as well as medical applications directed to 
humans. 

"Providing a biological sample" means to obtain a biological sample for use 
in the methods described in this invention. Most often, this will be done by removing a 
sample of cells from an animal, but can also be accomplished by using previously isolated 
cells {e.g. isolated by another person), or by performing the methods of the invention in vivo, 

A "biological sample" refers to a cell or population of cells or a quantity of 
tissue or fluid from an animal. Most often, the sample has been removed from an animal, 
but the term "biological sample" can also refer to cells or tissue analyzed in vivo, i.e. without 
removal from the animal. Often, a "biological sample" will contain cells from the animal, 
but the term can also refer to non-cellular biological material, such as non-cellular fractions 
of blood, saliva, or urine, that can be used to measure gene expression levels. Preferred 
biological samples include tissue biopsies, scrapes {e.g. buccal scrapes), whole blood, 
plasma, serum, urine, saliva, cell culture, or cerebrospinal fluid. 

"Detecting a level of gene expression" refers to determining the number of 
genes or the expression level of a particular gene or genes. The copy number of a gene can 
be measured in multiple ways known to those of skill in the art, including, but not limited to, 
Comparative Genomic Hybridization (CGH) and quantitative DNA amplification {e.g. 
quantitative PCR). Gene expression can be monitored in a variety of ways, including by 
detecting mRNA levels, protein levels, or protein activity, any of which can be measured 
using standard techniques. Detection can involve quantification of the level of gene 
expression {e.g. genomic DNA, cDNA, mRNA, protein, or enzyme activity), or, 
alternatively, can be a qualitative assessment of the level of gene expression, in particular in 
comparison with a control level. The type of level being detected will be clear from the 
context. 

To "compare" levels of gene expression means to detect gene expression 
levels in two samples and to determine whether the levels are equal or if one or the other is 
greater. A comparison can be done between quantified levels, allowing statistical 
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comparison between the two values, or in the absence of quantification, for example using 
qualitative methods of detection such as visual assessment by a human. 

A "control sample" refers to a sample of biological material representative of 
healthy, cancer-free animals, and/or cells or tissues.. The level of gene expression in a 
5 control sample is desirably typical of the general population of normal, cancer-free animals 
or of a particular individual at a particular time (e.g. before, during or after a treatment 
regimen), or in a particular tissue. This sample can be removed from an animal expressly for 
use in the methods described in this invention, or can be any biological material 
representative of normal, cancer- free animals, including cancer-free biological material 

10 taken from an animal with cancer elsewhere in its body. A control sample can also refer to 
an established level of gene expression, representative of the cancer- free population, that has 
been previously established based on measurements from normal, cancer-free animals. 

An "increased level of gene expression" means a level of gene expression, 
that, in comparison with a control level of gene expression, is detectably higher. The 

1 5 method of comparison can be statistical, using quantified values for the level of gene 

expression, or can be compared using non-statistical means, such as by visual assessment by 
a human. 

The "copy number of a gene" refers to the number of DNA sequences in a 
cell encoding a particular gene product. Generally, for a given gene, an animal has two 

20 copies of each gene. The copy number can be increased, however, by gene amplification or 
duplication, or reduced by deletion. 

When a level of a particular gene mRNA, protein, enzyme activity, or copy 
number is "measured," it is assessed using qualitative or quantitative methods. Preferably, 
the level is determined using quantitative means, allowing the statistical comparison of 

25 values obtained from biological samples and control values. The level can also be 

determined using qualitative methods, such as the visual analysis and comparison by a 
human of multiple visibly labeled samples, e.g. fluorescently labeled samples detected using 
a fluorescent microscope or other optical detector {e.g. image analysis system, etc.). When a 
level of a particular gene mRNA, protein, or enzyme activity is measured the measurement 

30 preferably includes a measurement of activity in a normal tissue or cell (e.g. from the same 
animal or from a different "control" animal). 
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"Tissue biopsy" refers to the removal of a biological sample for diagnostic 
analysis. In a patient with cancer, tissue may be removed from a tumor, allowing the 
analysis of cells within the tumor. 

When a quantified level of a gene falls outside of a given confidence interval 
5 for a normal level of the gene, the difference between the two levels is said to be 

"statistically significant." If a test value falls outside of a given confidence interval for a 
normal level of the gene, it is possible to calculate the probability that the test value is truly 
abnormal and does not just represent a normal deviation from the average. In the methods of 
this invention, a difference between a test sample and a control can be termed "statistically 
10 significant" when the probability of the test sample being abnormal can be any of a number 
of values, including 0.15, 0.1, 0.05, and 0.0L Numerous sources teach how to assess 
statistical significance, such as Freund, J.E. (1988) Modern elementary statistics, Prentice- 
Hall. 

The "survival expectancy" of an animal refers to a prognostic estimate of the 
15 outcome of a disease or condition. A "survival expectancy" can refer to a prediction 

regarding the severity, duration, or progress of a disease, condition, or any symptom thereof. 
"Survival expectancy" can also refer to the length of time an animal is expected to survive, 
or to the probability of the animal surviving until a certain time. 

A "method of treating cancer" refers to a procedure or course of action that is 
20 designed to reduce or eliminate the number of cancer cells in an animal, or to alleviate the 
symptoms of a cancer. "A method of treating cancer" does not necessarily mean that the 
cancer cells will in fact be eliminated, that the number of cells will in fact be reduced, or that 
the symptoms of a cancer will in fact be alleviated. Often, a method of treating cancer will 
be performed even with a low likelihood of success, but which, given the medical history 
25 and estimated survival expectancy of an animal, is deemed an overall beneficial course of 
action. 

"Reducing the level of gene activity" refers to inhibiting the gene product 
activity in the cell, or lowering the copy number of the gene, or decreasing the level of the 
gene's transcribed mRNA or translated protein in the cell. Preferably, the level of the 
30 particular gene activity is lowered to the level typical of a normal, cancer-free cell, but the 
level may be reduced to any level that is sufficient to decrease the proliferation of the cell, 
including to levels below those typical of normal cells. 
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A "tumor cell" is a cancer cell that is part of a tumor, has been isolated from a 
tumor, or which is capable of forming a tumor. Tumor cells can exist in vivo or in vitro. 

A "hyperproliferative cell" is a cell with an abnormally high rate of 
proliferation, or a cell that proliferates to an abnormally great extent, i.e. gives rise to a 
5 population of cells that increases in number over time. "Hyperproliferative cells" can exist 
in vitro or in vivo. 

An "inhibitor of gene activity" is a molecule that acts to reduce gene activity, 
as defined above. Such inhibitors can include antisense molecules or ribozymes, repressors 
of gene transcription, or competitive or non-competitive molecular inhibitors of the gene 
10 product. 

The phrase "repressor of transcription" refers to a molecule that can prevent 
the production of mRNA from a particular gene. Preferably, the molecule binds directly or 
indirectly to a regulatory element of the gene, thereby preventing the transcription of the 
gene. 

1 5 "Screening" for an inhibitor of cell proliferation or of gene activity means to 

systematically examine the ability of a population of molecules to inhibit cell proliferation or 
gene activity. Screening can be done in vitro or in vivo. The inhibitory activity of the 
screened molecules can be assessed directly, e.g. by examining gene product activity using 
standard assays, or indirectly, e.g. by monitoring a cellular consequence of gene product 

20 activity, such as cell proliferation. 

A "gene-expressing cell" is a cell that produces any amount of the protein 
product of the gene. Generally, the amount of enzyme produced by the cell will be 
detectable using standard techniques. 

A "test agent" is any molecule or non-molecular entity that is tested in a 

25 screen. The molecule may be randomly selected for inclusion in the screen, or may be 

included because of an a priori expectation that the molecule will give a positive result in 
the screen. The molecule may be directly introduced into a cell or a biochemical assay for 
the purposes of the screen, or it may comprise nucleic acids that encode a polypeptide or 
RNA that is desirably tested in the screen. Molecules introduced directly into an assay 

30 system can include any known chemical or biochemical molecule, including peptides, 
nucleic acids, carbohydrates, lipids, or any other organic or inorganic molecule. A "test 
agent" can also refer to non-molecular entities such as electromagnetic radiation or heat. 
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The "proliferation" of a cell refers to the rate at which the cell or population 
of cells grows and divides, or to the extent to which the cell or population of cells grows, 
divides, or increases in number. The "proliferation" of a cell can reflect multiple factors 
including the rate of cell growth and division and the rate of cell death. 
5 The phrase "decreasing the proliferation of a cell" means to reduce the rate or 

extent of growth or division of a cell or population of cells. Such methods can involve 
preventing cell division or cell growth, and may also include cell killing, and can be 
practiced in vivo or in vitro, 

"Gene-inhibiting activity" is the ability of a molecule to reduce or prevent the 
10 production and/or accumulation of gene product activity in a cell. The molecule can prevent 
the accumulation at any step of the pathway from the gene to enzyme activity, e.g. 
preventing transcription, reducing mRNA levels, preventing translation, or inhibiting the 
enzyme itself. 

The terms "polypeptide", "peptide" and "protein" are used interchangeably 
1 5 herein to refer to a polymer of amino acid residues. The terms apply to amino acid polymers 
in which one or more amino acid residue is an artificial chemical analogue of a 
corresponding naturally occurring amino acid, as well as to naturally occurring amino acid 
polymers. 

An amino acid, identified by name herein "e.g., arginine" or "arginine 
20 residue" as used herein refers to natural, synthetic, or version of the amino acids Thus, for 
example, an arginine can also include arginine analogs that offer the same or similar 
functionality as natural arginine with respect to their ability to be incorporated into a 
polypeptide, effect folding of that polypeptide and effect interactions of that polypeptide 
with other polypeptide(s). 

25 BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1 illustrates the genes of the present invention that are amplified in 
some cell lines. A GenBank Accession number as well as locus identification is provided 
for each gene. 

Figure 2 illustrates the genes of the present invention that are deleted in some 
30 cell lines. A GenBank Accession number as well as locus identification is provided for each 

gene. 
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DETAILED DESCRIPTION 

This invention pertains to the discovery that amplification of one or more of 
the genes of Figure 1 and/or deletion of one or more of the genes of Figure 2 may be 
associated with cancer occurrence and progression. In view of these discoveries, the genes 
5 of Figuer 1 and 2 provide a good marker for a cancer and/or for the likelihood of 

(predilection to) development of a cancer. Thus, in one embodiment, this invention provides 
methods of detecting the presence of, or a predisposition to, cancer in an animal. The 
methods involve (i) providing a biological sample from an animal (e.g. a human patient); (ii) 
detecting the level of a gene of Figure 1 within the biological sample; and (iii) comparing the 

10 level of a gene of Figure 1 with a level of a gene of Figure 1 in a control sample taken from a 
normal, cancer- free animal where an increased level of a gene of Figure 1 in the biological 
sample compared to the level of a gene of Figure 1 in the control sample indicates the 
presence of said cancer in said animal. Where the a gene of Figure 1 transcript, translated 
polypeptide, or enzymatic activity is assayed. In another embodiment, the methods involve 

15 (i) providing a biological sample from an animal (e.g. a human patient); (ii) detecting the 
level of a gene of Figure 2 within the biological sample; and (iii) comparing the level of a 
gene of Figure 2 with a level of a gene of Figure 2 in a control sample taken from a normal, 
cancer-free animal where a decreased level of a gene of Figure 2 in the biological sample 
compared to the level of a gene of Figure 2 in the control sample indicates the presence of 

20 said cancer in said animal. Where the a gene of Figure 2 transcript, translated polypeptide, 
or enzymatic activity is assayed. 

Similarly, the detection of a gene of Figure 1 or Figure 2 level can also be 
used to estimate the survival expectancy of an animal with cancer. Because level of a gene 
of Figure lean be used to assay survival expectancy (e.g. likelihood of progression or 

25 recurrence of the disease), an assay of such gene level provides a useful component of a 

cancer therapy. Thus, in one preferred method of treating cancer, level of a gene of Figure 1 
or Figure 2 is assayed and, where the level of a gene of Figure 1 is high relative to the 
appropriate control or population standard or where the level of a gene of Figure 2 is low 
relative to the appropriate control or population standard, one or more adjuvant therapies 

30 (e.g. radiation therapy, resurgery, chemotherapy, etc.) are selected for the cancer treatment 
regimen. 

Having identified elevated gene levels as indicative of a cancer or a 

predisposition to cancer, gene level provides a useful target/marker for evaluating potential 
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prophylaxis and/or therapeutics. Thus, for example, the level of gene activity in the presence 
or absence of one or more putative potential therapeutics or prophylactics provides a 
measure of the potential activity of the therapeutic/prophylactic compound, i.e., a lower gene 
activity in the presence of the compound indicates higher potential activity of the compound. 
5 In another embodiment this invention provides a method of decreasing the 

proliferation of a cell {e.g. a cancer cell). The method involves reducing the level of gene 
activity in said cell using an inhibitor of the gene. 

I, Assays 

As indicated above, assays of gene copy number or level of activity provide a 
10 measure of the presence or likelihood of (predisposition to) a cancer. The sequence of the 
genes of Figures 1 and 2 are known and hence, copy number can be directly measured 
according to a number of different methods as described below. 



A) Detection of copy number 

15 In one embodiment, the presence of, or predilection to cancer, is evaluated 

simply by a determination of gene copy number. Methods of evaluating the copy number of 
a particular gene are well known to those of skill in the art. 

1) Hvbridization-based assays 

One method for evaluating the copy number of encoding nucleic acid in a 
20 sample involves a Southern transfer. In a Southern Blot, the genomic DNA (typically 

fragmented and separated on an electrophoretic gel) is hybridized to a probe specific for the 
target region. Comparison of the intensity of the hybridization signal from the probe for the 
target region with control probe signal from analysis of normal genomic DNA {e.g., a non- 
amplified portion of the same or related cell, tissue, organ, etc.) provides an estimate of the 
25 relative copy number of the target nucleic acid. 

An alternative means for determining the copy number is in situ 
hybridization. In situ hybridization assays are well known {e.g., Angerer (1987) Meth. 
Enzymol 152: 649). Generally, in situ hybridization comprises the following major steps: (1) 
fixation of tissue or biological structure to be analyzed; (2) prehybridization treatment of the 
30 biological structure to increase accessibility of target DNA, and to reduce nonspecific 

binding; (3) hybridization of the mixture of nucleic acids to the nucleic acid in the biological 
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structure or tissue; (4) post-hybridization washes to remove nucleic acid fragments not 
bound in the hybridization and (5) detection of the hybridized nucleic acid fragments. The 
reagent used in each of these steps and the conditions for use vary depending on the 
particular application. 

5 Preferred hybridization-based assays include, but are not limited to, 

traditional "direct probe" methods such as Southern blots or in situ hybridization {e.g., 
FISH), and "comparative probe" methods such as comparative genomic hybridization 
(CGH). The methods can be used in a wide variety of formats including, but not limited to 
substrate- (e.g. membrane or glass) bound methods or array-based approaches as described 
10 below. 

In a typical in situ hybridization assay, cells are fixed to a solid support, 
typically a glass slide. If a nucleic acid is to be probed, the cells are typically denatured with 
heat or alkali. The cells are then contacted with a hybridization solution at a moderate 
temperature to permit annealing of labeled probes specific to the nucleic acid sequence 
15 encoding the protein. The targets (e.g., cells) are then typically washed at a predetermined 
stringency or at an increasing stringency until an appropriate signal to noise ratio is obtained. 

The probes are typically labeled, e.g., with radioisotopes or fluorescent 
reporters. Preferred probes are sufficiently long so as to specifically hybridize with the target 
nucleic acid(s) under stringent conditions. The preferred size range is from about 200 bp to 
20 about 1000 bases. 

In some applications it is necessary to block the hybridization capacity of 
repetitive sequences. Thus, in some embodiments, tRNA, human genomic DNA, or Cot-1 
DNA is used to block non- specific hybridization. 

In comparative genomic hybridization methods a first collection of (sample) 
25 nucleic acids (e.g. from a possible tumor) is labeled with a first label, while a second 

collection of (control) nucleic acids (e.g. from a healthy cell/tissue) is labeled with a second 
label. The ratio of hybridization of the nucleic acids is determined by the ratio of the two 
(first and second) labels binding to each fiber in the array. Where there are chromosomal 
deletions or multiplications, differences in the ratio of the signals from the two labels will be 
30 detected and the ratio will provide a measure of the copy number. 

Hybridization protocols suitable for use with the methods of the invention are 
described, e.g., in Albertson (1984) EMBO J. 3: 1227-1234; Pinkel (1988) Proc. Natl. Acad. 
Sci. USA 85: 9138-9142; EPO Pub. No. 430,402; Methods in Molecular Biology, Vol 33: In 



Situ Hybridization Protocols, Choo, ed., Humana Press, Totowa, NJ (1994), etc. In one 
particularly preferred embodiment, the hybridization protocol of Pinkel et ah (1998) Nature 
Genetics 20: 207-21 1, or of Kallioniemi (1992) Proa Natl Acad Sci USA 89:5321-5325 
(1992) is used. 

5 2) Amplification-based assays. 

In still another embodiment, amplification-based assays can be used to 
measure copy number. In such amplification-based assays, the nucleic acid sequences act as 
a template in an amplification reaction (e.g. Polymerase Chain Reaction (PCR). In a 
quantitative amplification, the amount of amplification product will be proportional to the 

10 amount of template in the original sample. Comparison to appropriate (e.g. healthy tissue) 
controls provides a measure of the copy number. 

Methods of "quantitative" amplification are well known to those of skill in the 
art. For example, quantitative PCR involves simultaneously co-amplifying a known quantity 
of a control sequence using the same primers. This provides an internal standard that may be 

15 used to calibrate the PCR reaction. Detailed protocols for quantitative PCR are provided in 
Innis et ah (1990) PCR Protocols, A Guide to Methods and Applications, Academic Press, 
Inc. N.Y.). The known nucleic acid sequence for the genes is sufficient to enable one of skill 
to routinely select primers to amplify any portion of the gene. 

Other suitable amplification methods include, but are not limited to ligase 

20 chain reaction (LCR) (see Wu and Wallace (1989) Genomics 4: 560, Landegren et ah (1988) 
Science 241 : 1077, and Barringer et ah (1990) Gene 89: 117, transcription amplification 
(Kwoh et ah (1989) Proc. Natl. Acad. Sci. USA 86: 1 173), self-sustained sequence 
replication (Guatelli et ah (1990) Proc. Nat. Acad. Set USA 87: 1874), dot PCR, and linker 
adapter PCR, etc. 

25 B) Detection of gene expression 

As indicated above, gene level can also be assayed as a marker for a 
predilection to cancer. In preferred embodiments, activity of a particular gene is 
characterized by a measure of gene transcript (e.g. mRNA), by a measure of the quantity of 
translated protein, or by a measure of gene product activity. 
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1) Detection of gene transcript. 



a) Direct hybridization based assays. 

Methods of detecting and/or quantifying the gene transcript (mRNA or cDNA 
made therefrom) using nucleic acid hybridization techniques are known to those of skill in 
5 the art (see Sambrook et al. supra). For example, one method for evaluating the presence, 
absence, or quantity of cDNA involves a Southern transfer as described above. Briefly, the 
mRNA is isolated {e.g. using an acid guanidinium-phenol-chloroform extraction method, 
Sambrook et al supra.) and reverse transcribed to produce cDNA. The cDNA is then 
optionally digested and run on a gels in buffer and transferred to membranes. Hybridization 
10 is then carried out using the nucleic acid probes specific for the target cDNA. 

The probes can be full length or less than the full length of the nucleic acid 
sequence encoding the protein. Shorter probes are empirically tested for specificity. 
Preferably nucleic acid probes are 20 bases or longer in length. (See Sambrook et ah for 
methods of selecting nucleic acid probe sequences for use in nucleic acid hybridization.) 
15 Visualization of the hybridized portions allows the qualitative determination of the presence 
or absence of cDNA. 

Similarly, a Northern transfer may be used for the detection of mRNA 
directly. In brief, the mRNA is isolated from a given cell sample using, for example, an acid 
guanidinium-phenol-chloroform extraction method. The mRNA is then electrophoresed to 
20 separate the mRNA species and the mRNA is transferred from the gel to a nitrocellulose 
membrane. As with the Southern blots, labeled probes are used to identify and/or quantify 
the mRNA. 

Amplification-based assays. 

In another preferred embodiment, a transcript {e.g., mRNA) can be measured 
25 using amplification {e.g. PCR) based methods as described above for directly assessing copy 
number of DNA. In a preferred embodiment, transcript level is assessed by using reverse 
transcription PCR (RT-PCR). 

As indicated above, PCR assay methods are well known to those of skill in 
the art. Similarly, RT-PCR methods are also well known. Moreover, probes for such an 
30 RT-PCR assay are provided below in Figure 1 and the assay is illustrated in Example 1 {see, 
e.g., Figure 3). 



2) Detection of expressed protein 

The "activity" of a gene can also be detected and/or quantified by detecting or 
quantifying the expressed polypeptide. The polypeptide can be detected and quantified by 
any of a number of means well known to those of skill in the art. These may include 
5 analytic biochemical methods such as electrophoresis, capillary electrophoresis, high 
performance liquid chromatography (HPLC), thin layer chromatography (TLC), 
hyperdiffiision chromatography, and the like, or various immunological methods such as 
fluid or gel precipitin reactions, immunodiffusion (single or double), 
Immunoelectrophoresis, radioimmunoassay (RIA), enzyme-linked immunosorbent assays 

10 (ELISAs), immuno fluorescent assays, western blotting, and the like. 

In one preferred embodiment, the polypeptide is detected/quantified in an 
electrophoretic protein separation (e.g. a 1- or 2-dimensional electrophoresis). Means of 
detecting proteins using electrophoretic techniques are well known to those of skill in the art 
(see generally, R. Scopes (1982) Protein Purification, Springer- Verlag, N.Y.; Deutscher, 

15 (1990) Methods in Enzymology Vol. 182: Guide to Protein Purification, Academic Press, 
Inc., N.Y.). 

In another preferred embodiment, Western blot (immunoblot) analysis is used 
to detect and quantify the presence of a polypeptide in the sample. This technique generally 
comprises separating sample proteins by gel electrophoresis on the basis of molecular 

20 weight, transferring the separated proteins to a suitable solid support, (such as a 

nitrocellulose filter, a nylon filter, or derivatized nylon filter), and incubating the sample 
with the antibodies that specifically bind a polypeptide. The anti-polypeptide antibodies 
specifically bind to the polypeptide on the solid support. These antibodies may be directly 
labeled or alternatively may be subsequently detected using labeled antibodies (e.g., labeled 

25 sheep anti-mouse antibodies) that specifically bind to the anti-polypeptide. 

In a more preferred embodiment, the polypeptide is detected using an 
immunoassay. As used herein, an immunoassay is an assay that utilizes an antibody to 
specifically bind to the analyte. The immunoassay is thus characterized by detection of 
specific binding of a polypeptide to an anti-antibody as opposed to the use of other physical 

30 or chemical properties to isolate, target, and quantify the analyte. 

The polypeptide is detected and/or quantified using any of a number of well 
recognized immunological binding assays (see, e.g., U.S. Patents 4,366,241; 4,376,110; 
4,517,288; and 4,837,168). For a review of the general immunoassays, see also Asai (1993) 



Methods in Cell Biology Volume 37: Antibodies in Cell Biology, Academic Press, Inc. New 
York; Stites & Terr (1991) Basic and Clinical Immunology 7th Edition. 

Immunological binding assays (or immunoassays) typically utilize a "capture 
agent" to specifically bind to and often immobilize the analyte (polypeptide or 
5 subsequence). The capture agent is a moiety that specifically binds to the analyte. In a 

preferred embodiment, the capture agent is an antibody that specifically binds a polypeptide. 
The antibody (anti-peptide) may be produced by any of a number of means well known to 
those of skill in the art. 

Immunoassays also often utilize a labeling agent to specifically bind to and 

10 label the binding complex formed by the capture agent and the analyte. The labeling agent 
may itself be one of the moieties comprising the antibody/analyte complex. Thus, the 
labeling agent may be a labeled polypeptide or a labeled anti-antibody. Alternatively, the 
labeling agent may be a third moiety, such as another antibody, that specifically binds to the 
antibody/polypeptide complex. 

15 In one preferred embodiment, the labeling agent is a second human antibody 

bearing a label. Alternatively, the second antibody may lack a label, but it may, in turn, be 
bound by a labeled third antibody specific to antibodies of the species from which the second 
antibody is derived. The second can be modified with a detectable moiety, e.g., as biotin, to 
which a third labeled molecule can specifically bind, such as enzyme-labeled streptavidin. 

20 Other proteins capable of specifically binding immunoglobulin constant 

regions, such as protein A or protein G may also be used as the label agent. These proteins 
are normal constituents of the cell walls of streptococcal bacteria. They exhibit a strong non- 
immunogenic reactivity with immunoglobulin constant regions from a variety of species 
{see, generally Kronval, et al (1973) J. Immunol, 111: 1401-1406, and Akerstrom (1985) J. 

25 Immunol, 135: 2589-2542). 

As indicated above, immunoassays for the detection and/or quantification of a 
polypeptide can take a wide variety of formats well known to those of skill in the art. 

Preferred immunoassays for detecting a polypeptide are either competitive or 
noncompetitive. Noncompetitive immunoassays are assays in which the amount of captured 

30 analyte is directly measured. In one preferred "sandwich" assay, for example, the capture 
agent (anti-peptide antibodies) can be bound directly to a solid substrate where they are 
immobilized. These immobilized antibodies then capture polypeptide present in the test 
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sample. The polypeptide thus immobilized is then bound by a labeling agent, such as a 
second human antibody bearing a label. 

In competitive assays, the amount of analyte (polypeptide) present in the 
sample is measured indirectly by measuring the amount of an added (exogenous) analyte 
(polypeptide) displaced (or competed away) from a capture agent (anti peptide antibody) by 
the analyte present in the sample. In one competitive assay, a known amount of, in this case, 
a polypeptide is added to the sample and the sample is then contacted with a capture agent. 
The amount of polypeptide bound to the antibody is inversely proportional to the 
concentration of polypeptide present in the sample. 

In one particularly preferred embodiment, the antibody is immobilized on a 
solid substrate. The amount of polypeptide bound to the antibody may be determined either 
by measuring the amount of polypeptide present in an polypeptide /antibody complex, or 
alternatively by measuring the amount of remaining uncomplexed polypeptide. The amount 
of polypeptide may be detected by providing a labeled polypeptide. 

The assays of this invention are scored (as positive or negative or quantity of 
polypeptide) according to standard methods well known to those of skill in the art. The 
particular method of scoring will depend on the assay format and choice of label. For 
example, a Western Blot assay can be scored by visualizing the colored product produced by 
the enzymatic label. A clearly visible colored band or spot at the correct molecular weight is 
scored as a positive result, while the absence of a clearly visible spot or band is scored as a 
negative. The intensity of the band or spot can provide a quantitative measure of 
polypeptide. 

Antibodies for use in the various immunoassays described herein, can be 
produced as described below. 

3) Detection of enzyme activity. 

In another embodiment, level (activity) is assayed by measuring the 
enzymatic activity of the gene product. Methods of assaying the activity of this enzyme are 
well known to those of skill in the art. 
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O Hybridization Formats and Optimization of hybridization conditions. 
1) Array-based hybridization formats. 

The methods of this invention are particularly well suited to array-based 
hybridization formats. For a description of one preferred array-based hybridization system 
5 see Pinkel et al (1998) Nature Genetics, 20: 207-21 1 . 

Arrays are a multiplicity of different "probe" or "target" nucleic acids (or 
other compounds) attached to one or more surfaces {e.g., solid, membrane, or gel). In a 
preferred embodiment, the multiplicity of nucleic acids (or other moieties) is attached to a 
single contiguous surface or to a multiplicity of surfaces juxtaposed to each other. 
10 In an array format a large number of different hybridization reactions can be 

run essentially "in parallel." This provides rapid, essentially simultaneous, evaluation of a 
number of hybridizations in a single "experiment". Methods of performing hybridization 
reactions in array based formats are well known to those of skill in the art (see, e.g., Pastinen 
(1 997) Genome Res. 7: 606-614; Jackson (1996) Nature Biotechnology 14:1685; Chee 
15 (1 995) Science 274: 610; WO 96/17958, Pinkel et al (1998) Nature Genetics 20: 207-21 1). 

Arrays, particularly nucleic acid arrays can be produced according to a wide 
variety of methods well known to those of skill in the art. For example, in a simple 
embodiment, "low density" arrays can simply be produced by spotting (e.g. by hand using a 
pipette) different nucleic acids at different locations on a solid support (e.g. a glass surface, a 
20 membrane, etc. ) . 

This simple spotting, approach has been automated to produce high density 
spotted arrays (see, e.g., U.S. Patent No: 5,807,522). This patent describes the use of an 
automated system that taps a microcapillary against a surface to deposit a small volume of a 
biological sample. The process is repeated to generate high density arrays. 
25 Arrays can also be produced using oligonucleotide synthesis technology. 

Thus, for example, U.S. Patent No. 5,143,854 and PCT Patent Publication Nos. WO 
90/15070 and 92/10092 teach the use of light-directed combinatorial synthesis of high 
density oligonucleotide arrays. 

In brief, the light-directed combinatorial synthesis of oligonucleotide arrays 
30 on glass surfaces proceeds using automated phosphoramidite chemistry and chip masking 
techniques. In one specific implementation, a glass surface is derivatized with a silane 
reagent containing a functional group, e.g., a, hydroxyl or amine group blocked by a 
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photolabile protecting group. Photolysis through a photolithogaphic mask is used selectively 
to expose functional groups which are then ready to react with incoming S'-photoprotected 
nucleoside phosphoramidites. The phosphoramidites react only with those sites which are 
illuminated (and thus exposed by removal of the photolabile blocking group). Thus, the 
5 phosphoramidites only add to those areas selectively exposed from the preceding step. 
These steps are repeated until the desired array of sequences have been synthesized on the 
solid surface. Combinatorial synthesis of different oligonucleotide analogues at different 
locations on the array is determined by the pattern of illumination during synthesis and the 
order of addition of coupling reagents. 
10 In a preferred embodiment, the arrays used in this invention can comprise 

either probe or target nucleic acids. These probes or target nucleic acids are then hybridized 
respectively with their "target" nucleic acids. Because the gene sequences are disclosed 
herein, oligonucleotide arrays can be synthesized containing one or multiple probes specific 
to the subject genes. 

15 In another embodiment the array, particularly a spotted array, can include 

genomic DNA, e.g. overlapping clones that provide a high resolution scan of the amplicon 
containing the subject gene, or of the gene itself Amplicon nucleic acid can be obtained 
from, e.g., HACs, MACs, YACs, BACs, PACs, Pis, cosmids, plasmids, inter- Alu PCR 
products of genomic clones, restriction digests of genomic clones, cDNA clones, 

20 amplification (e.g., PCR) products, and the like. 

In various embodiments, the array nucleic acids are derived from previously 
mapped libraries of clones spanning or including the amplicon sequences of the invention, as 
well as clones from other areas of the genome, as described below. The arrays can be 
hybridized with a single population of sample nucleic acid or can be used with two 

25 differentially labeled collections (as with an test sample and a reference sample). 

Many methods for immobilizing nucleic acids on a variety of solid surfaces 
are known in the art. A wide variety of organic and inorganic polymers, as well as other 
materials, both natural and synthetic, can be employed as the material for the solid surface. 
Illustrative solid surfaces include, e.g., nitrocellulose, nylon, glass, quartz, diazotized 

30 membranes (paper or nylon), silicones, polyformaldehyde, cellulose, and cellulose acetate. 
In addition, plastics such as polyethylene, polypropylene, polystyrene, and the like can be 
used. Other materials which may be employed include paper, ceramics, metals, metalloids, 
semiconductive materials, cermets or the like. In addition, substances that form gels can be 
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used. Such materials include, e.g. , proteins (e.g., gelatins), lipopolysaccharides, silicates, 
agarose and polyacrylamides. Where the solid surface is porous, various pore sizes may be 
employed depending upon the nature of the system. 

In preparing the surface, a plurality of different materials may be employed, 
5 particularly as laminates, to obtain various properties. For example, proteins (e.g., bovine 
serum albumin) or mixtures of macromolecules (e.g., Denhardt's solution) can be employed 
to avoid non-specific binding, simplify covalent conjugation, enhance signal detection or the 
like. If covalent bonding between a compound and the surface is desired, the surface will 
usually be polyfunctional or be capable of being polyfunctionalized. Functional groups 

10 which may be present on the surface and used for linking can include carboxylic acids, 

aldehydes, amino groups, cyano groups, ethylenic groups, hydroxyl groups, mercapto groups 
and the like. The manner of linking a wide variety of compounds to various surfaces is well 
known and is amply illustrated in the literature. 

For example, methods for immobilizing nucleic acids by introduction of 

15 various functional groups to the molecules is known (see, e.g., Bischoff (1987) Anal. 
Biochem., 164: 336-344; Kremsky (1987) Nucl. Acids Res. 15:2891-2910). Modified 
nucleotides can be placed on the target using PCR primers containing the modified 
nucleotide, or by enzymatic end labeling with modified nucleotides. Use of glass or 
membrane supports (e.g., nitrocellulose, nylon, polypropylene) for the nucleic acid arrays of 

20 the invention is advantageous because of well developed technology employing manual and 
robotic methods of arraying targets at relatively high element densities. Such membranes 
are generally available and protocols and equipment for hybridization to membranes is well 
known. 

Target elements of various sizes, ranging from 1 mm diameter down to 1 jam 
25 can be used. Smaller target elements containing low amounts of concentrated, fixed probe 
DNA are used for high complexity comparative hybridizations since the total amount of 
sample available for binding to each target element will be limited. Thus it is advantageous 
to have small array target elements that contain a small amount of concentrated probe DNA 
so that the signal that is obtained is highly localized and bright. Such small array target 
30 elements are typically used in arrays with densities greater than 10 4 /cm 2 . Relatively simple 
approaches capable of quantitative fluorescent imaging of 1 cm 2 areas have been described 
that permit acquisition of data from a large number of target elements in a single image (see, 
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e.g., Wittrup (1994) Cytometry 16:206-213, Pinkel et al (1998) Nature Genetics 20: 207- 
211). 

Arrays on solid surface substrates with much lower fluorescence than 
membranes, such as glass, quartz, or small beads, can achieve much better sensitivity. 
5 Substrates such as glass or fused silica are advantageous in that they provide a very low 
fluorescence substrate, and a highly efficient hybridization environment. Covalent 
attachment of the target nucleic acids to glass or synthetic fused silica can be accomplished 
according to a number of known techniques (described above). Nucleic acids can be 
conveniently coupled to glass using commercially available reagents. For instance, materials 

1 0 for preparation of silanized glass with a number of functional groups are commercially 
available or can be prepared using standard techniques (see, e.g., Gait (1984) 
Oligonucleotide Synthesis: A Practical Approach, IRL Press, Wash., D.C.). Quartz cover 
slips, which have at least 10-fold lower autofluorescence than glass, can also be silanized. 

Alternatively, probes can also be immobilized on commercially available 

15 coated beads or other surfaces. For instance, biotin end-labeled nucleic acids can be bound 
to commercially available avidin-coated beads. Streptavidin or anti-digoxigenin antibody 
can also be attached to silanized glass slides by protein-mediated coupling using e.g., protein 
A following standard protocols {see, e.g., Smith (1992) Science 258: 1 122-1 126). Biotin or 
digoxigenin end-labeled nucleic acids can be prepared according to standard techniques. 

20 Hybridization to nucleic acids attached to beads is accomplished by suspending them in the 
hybridization mix, and then depositing them on the glass substrate for analysis after washing. 
Alternatively, paramagnetic particles, such as ferric oxide particles, with or without avidin 
coating, can be used. 

In one particularly preferred embodiment, probe nucleic acid is spotted onto a 

25 surface (e.g., a glass or quartz surface). The nucleic acid is dissolved in a mixture of water, 
dimethylsulfoxide (DMSO), and nitrocellulose and spotted onto amino-silane coated glass 
slides. Small capillaries tubes can be used to "spot" the probe mixture. 

2) Other hybridization formats. 

A variety of nucleic acid hybridization formats are known to those skilled in 
30 the art. For example, common formats include sandwich assays and competition or 
displacement assays. Hybridization techniques are generally described in Hames and 
Higgins (1985) Nucleic Acid Hybridization, A Practical Approach, IRL Press; Gall and 



Pardue (1969) Proa Natl Acad. Sci. USA 63: 378-383; and John et al (1969) Nature 223: 
582-587. 

Sandwich assays are commercially useful hybridization assays for detecting 
or isolating nucleic acid sequences. Such assays utilize a "capture" nucleic acid covalently 
5 immobilized to a solid support and a labeled "signal" nucleic acid in solution. The sample 
will provide the target nucleic acid. The "capture" nucleic acid and "signal" nucleic acid 
probe hybridize with the target nucleic acid to form a "sandwich" hybridization complex. To 
be most effective, the signal nucleic acid should not hybridize with the capture nucleic acid. 
Typically, labeled signal nucleic acids are used to detect hybridization. 

10 Complementary nucleic acids or signal nucleic acids may be labeled by any one of several 
methods typically used to detect the presence of hybridized polynucleotides. The most 
common method of detection is the use of autoradiography with 3 H, 125 I, 35 S, 14 C, or 32 P- 
labelled probes or the like. Other labels include ligands that bind to labeled antibodies, 
fluorophores, chemi-luminescent agents, enzymes, and antibodies which can serve as 

15 specific binding pair members for a labeled ligand. 

Detection of a hybridization complex may require the binding of a signal 
generating complex to a duplex of target and probe polynucleotides or nucleic acids. 
Typically, such binding occurs through ligand and anti-ligand interactions as between a 
ligand-conjugated probe and an anti-ligand conjugated with a signal. 

20 The sensitivity of the hybridization assays may be enhanced through use of a 

nucleic acid amplification system that multiplies the target nucleic acid being detected. 
Examples of such systems include the polymerase chain reaction (PCR) system and the 
ligase chain reaction (LCR) system. Other methods recently described in the art are the 
nucleic acid sequence based amplification (NASBAO, Cangene, Mississauga, Ontario) and 

25 Q Beta Replicase systems. 

3^ Optimization of hybridization conditions. 

Nucleic acid hybridization simply involves providing a denatured probe and 
target nucleic acid under conditions where the probe and its complementary target can form 
stable hybrid duplexes through complementary base pairing. The nucleic acids that do not 
30 form hybrid duplexes are then washed away leaving the hybridized nucleic acids to be 
detected, typically through detection of an attached detectable label. It is generally 
recognized that nucleic acids are denatured by increasing the temperature or decreasing the 
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salt concentration of the buffer containing the nucleic acids, or in the addition of chemical 
agents, or the raising of the pH. Under low stringency conditions (e.g., low temperature 
and/or high salt and/or high target concentration) hybrid duplexes (e.g., DNArDNA, 
RNA:RNA, or RNA:DNA) will form even where the annealed sequences are not perfectly 
5 complementary. Thus specificity of hybridization is reduced at lower stringency. 
Conversely, at higher stringency (e.g., higher temperature or lower salt) successful 
hybridization requires fewer mismatches. 

One of skill in the art will appreciate that hybridization conditions may be 
selected to provide any degree of stringency. In a preferred embodiment, hybridization is 

10 performed at low stringency to ensure hybridization and then subsequent washes are 
performed at higher stringency to eliminate mismatched hybrid duplexes. Successive 
washes may be performed at increasingly higher stringency (e.g., down to as low as 0.25 X 
SSPE at 37°C to 70°C) until a desired level of hybridization specificity is obtained. 
Stringency can also be increased by addition of agents such as formamide. Hybridization 

1 5 specificity may be evaluated by comparison of hybridization to the test probes with 
hybridization to the various controls that can be present. 

In general, there is a tradeoff between hybridization specificity (stringency) 
and signal intensity. Thus, in a preferred embodiment, the wash is performed at the highest 
stringency that produces consistent results and that provides a signal intensity greater than 

20 approximately 10% of the background intensity. Thus, in a preferred embodiment, the 
hybridized array may be washed at successively higher stringency solutions and read 
between each wash. Analysis of the data sets thus produced will reveal a wash stringency 
above which the hybridization pattern is not appreciably altered and which provides 
adequate signal for the particular probes of interest. 

25 In a preferred embodiment, background signal is reduced by the use of a 

detergent (e.g., C-TAB) or a blocking reagent (e.g., tRNA, sperm DNA, cot-1 DNA, etc.) 
during the hybridization to reduce non-specific binding. In a particularly preferred 
embodiment, the hybridization is performed in the presence of about 10 jj-g/1 jaL tRNA. The 
use of blocking agents in hybridization is well known to those of skill in the art (see, e.g., 

30 Chapter 8 in P. Tijssen, supra.) 

Methods of optimizing hybridization conditions are well known to those of 
skill in the art (see, e.g., Tijssen (1993) Laboratory Techniques in Biochemistry and 
Molecular Biology, Vol 24: Hybridization With Nucleic Acid Probes, Elsevier, N.Y.). 



Optimal conditions are also a function of the sensitivity of label (e.g., 
fluorescence) detection for different combinations of substrate type, fluorochrome, excitation 
and emission bands, spot size and the like. Low fluorescence background surfaces can be 
used (see, e.g., Chu (1992) Electrophoresis 13:105-114). The sensitivity for detection of 
5 spots ("target elements") of various diameters on the candidate surfaces can be readily 
determined by, e.g., spotting a dilution series of fluorescently end labeled DNA fragments. 
These spots are then imaged using conventional fluorescence microscopy. The sensitivity, 
linearity, and dynamic range achievable from the various combinations of fluorochrome and 
solid surfaces (e.g., glass, fused silica, etc.) can thus be determined. Serial dilutions of pairs 
10 of fluorochrome in known relative proportions can also be analyzed. This determines the 

accuracy with which fluorescence ratio measurements reflect actual fluorochrome ratios over 
the dynamic range permitted by the detectors and fluorescence of the substrate upon which 
the probe has been fixed. 

4^ Labeling and detection of nucleic acids. 

15 In a preferred embodiment, the hybridized nucleic acids are detected by 

detecting one or more labels attached to the sample nucleic acids. The labels may be 
incorporated by any of a number of means well known to those of skill in the art. Means of 
attaching labels to nucleic acids include, for example nick translation, or end-labeling by 
kinasing of the nucleic acid and subsequent attachment (ligation) of a linker joining the 

20 sample nucleic acid to a label (e.g. , a fluorophore). A wide variety of linkers for the 

attachment of labels to nucleic acids are also known. In addition, intercalating dyes and 
fluorescent nucleotides can also be used. 

Detectable labels suitable for use in the present invention include any 
composition detectable by spectroscopic, photochemical, biochemical, immunochemical, 

25 electrical, optical or chemical means. Useful labels in the present invention include biotin 
for staining with labeled streptavidin conjugate, magnetic beads (e.g., Dynabeads™), 
fluorescent dyes (e.g., fluorescein, texas red, rhodamine, green fluorescent protein, and the 
like, see, e.g., Molecular Probes, Eugene, Oregon, USA), radiolabels (e.g., 3 H, 125 1, 35 S, 14 C, 
or 32 P), enzymes (e.g., horse radish peroxidase, alkaline phosphatase and others commonly 

30 used in an ELISA), and colorimetric labels such as colloidal gold (e.g., gold particles in the 
40 -80 nm diameter size range scatter green light with high efficiency) or colored glass or 
plastic (e.g., polystyrene, polypropylene, latex, etc.) beads. Patents teaching the use of such 



labels include U.S. Patent Nos. 3,817,837; 3,850,752; 3,939,350; 3,996,345; 4,277,437; 
4,275,149; and 4,366,241. 

A fluorescent label is preferred because it provides a very strong signal with 
low background. It is also optically detectable at high resolution and sensitivity through a 
quick scanning procedure. The nucleic acid samples can all be labeled with a single label, 
e.g., a single fluorescent label. Alternatively, in another embodiment, different nucleic acid 
samples can be simultaneously hybridized where each nucleic acid sample has a different 
label. For instance, one target could have a green fluorescent label and a second target could 
have a red fluorescent label. The scanning step will distinguish sites of binding of the red 
label from those binding the green fluorescent label. Each nucleic acid sample (target 
nucleic acid) can be analyzed independently from one another. 

Suitable chromogens which can be employed include those molecules and 
compounds which absorb light in a distinctive range of wavelengths so that a color can be 
observed or, alternatively, which emit light when irradiated with radiation of a particular 
wave length or wave length range, e.g., fluorescers. 

Desirably, fluorescers should absorb light above about 300 nm, preferably 
about 350 nm, and more preferably above about 400 nm, usually emitting at wavelengths 
greater than about 10 nm higher than the wavelength of the light absorbed. It should be 
noted that the absorption and emission characteristics of the bound dye can differ from the 
unbound dye. Therefore, when referring to the various wavelength ranges and 
characteristics of the dyes, it is intended to indicate the dyes as employed and not the dye 
which is unconjugated and characterized in an arbitrary solvent. 

Fluorescers are generally preferred because by irradiating a fluorescer with 
light, one can obtain a plurality of emissions. Thus, a single label can provide for a plurality 
of measurable events. 

Detectable signal can also be provided by chemiluminescent and 
bioluminescent sources. Chemiluminescent sources include a compound which becomes 
electronically excited by a chemical reaction and can then emit light which serves as the 
detectable signal or donates energy to a fluorescent acceptor. Alternatively, luciferins can be 
used in conjunction with luciferase or lucigenins to provide bioluminescence. 

Spin labels are provided by reporter molecules with an unpaired electron spin 
which can be detected by electron spin resonance (ESR) spectroscopy. Exemplary spin 
labels include organic free radicals, transitional metal complexes, particularly vanadium, 
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copper, iron, and manganese, and the like. Exemplary spin labels include nitroxide free 
radicals. 

The label may be added to the target (sample) nucleic acid(s) prior to, or after 
the hybridization. So called "direct labels" are detectable labels that are directly attached to 
5 or incorporated into the target (sample) nucleic acid prior to hybridization. In contrast, so 
called "indirect labels" are joined to the hybrid duplex after hybridization. Often, the indirect 
label is attached to a binding moiety that has been attached to the target nucleic acid prior to 
the hybridization. Thus, for example, the target nucleic acid may be biotinylated before the 
hybridization. After hybridization, an avidin-conjugated fluorophore will bind the biotin 

10 bearing hybrid duplexes providing a label that is easily detected. For a detailed review of 
methods of labeling nucleic acids and detecting labeled hybridized nucleic acids see 
Laboratory Techniques in Biochemistry and Molecular Biology, Vol 24: Hybridization 
With Nucleic Acid Probes, P. Tijssen, ed. Elsevier, N.Y., (1993)). 

Fluorescent labels are easily added during an in vitro transcription reaction. 

1 5 Thus, for example, fluorescein labeled UTP and CTP can be incorporated into the RNA 
produced in an in vitro transcription. 

The labels can be attached directly or through a linker moiety. In general, the 
site of label or linker-label attachment is not limited to any specific position. For example, a 
label may be attached to a nucleoside, nucleotide, or analogue thereof at any position that 

20 does not interfere with detection or hybridization as desired. For example, certain Label-ON 
Reagents from Clontech (Palo Alto, CA) provide for labeling interspersed throughout the 
phosphate backbone of an oligonucleotide and for terminal labeling at the 3 ! and 5' ends. As 
shown for example herein, labels can be attached at positions on the ribose ring or the ribose 
can be modified and even eliminated as desired. The base moieties of useful labeling 

25 reagents can include those that are naturally occurring or modified in a manner that does not 
interfere with the purpose to which they are put. Modified bases include but are not limited 
to 7-deaza A and G, 7-deaza-8-aza A and G, and other heterocyclic moieties. 

It will be recognized that fluorescent labels are not to be limited to single 
species organic molecules, but include inorganic molecules, multi-molecular mixtures of 

30 organic and/or inorganic molecules, crystals, heteropolymers, and the like. Thus, for 
example, CdSe-CdS core-shell nanocrystals enclosed in a silica shell can be easily 
derivatized for coupling to a biological molecule (Bruchez et ah (1998) Science, 281 : 2013- 
2016). Similarly, highly fluorescent quantum dots (zinc sulfide-capped cadmium selenide) 
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have been covalently coupled to biomolecules for use in ultrasensitive biological detection 
(Warren andNie (1998) Science, 281: 2016-2018). 



Antibodies 

Either polyclonal or monoclonal antibodies may be used in the immunoassays 
5 and therapeutic methods of the invention described herein. Polyclonal antibodies are 
preferably raised by multiple injections {e.g. subcutaneous or intramuscular injections) of 
substantially pure polypeptides or antigenic polypeptides into a suitable non-human 
mammal. The antigenicity of peptides can be determined by conventional techniques to 
determine the magnitude of the antibody response of an animal that has been immunized 

1 0 with the peptide. Generally, the peptides that are used to raise the anti-peptide antibodies 
should generally be those which induce production of high titers of antibody with relatively 
high affinity for the polypeptide. 

If desired, the immunizing peptide may be coupled to a carrier protein by 
conjugation using techniques which are well-known in the art. Such commonly used carriers 

15 which are chemically coupled to the peptide include keyhole limpet hemocyanin (KLH), 
thyroglobulin, bovine serum albumin (BSA), and tetanus toxoid. The coupled peptide is 
then used to immunize the animal {e.g. a mouse or a rabbit). Because the polypeptide may 
be conserved among mammalian species, use of a carrier protein to enhance the 
immunogenicity of proteins is preferred. 

20 The antibodies are then obtained from blood samples taken from the 

mammal. The techniques used to develop polyclonal antibodies are known in the art (see, 
e.g., Methods ofEnzymology, "Production of Antisera With Small Doses of Immunogen: 
Multiple Intradermal Injections", Langone, et al eds. (Acad. Press, 1981)). Polyclonal 
antibodies produced by the animals can be further purified, for example, by binding to and 

25 elution from a matrix to which the peptide to which the antibodies were raised is bound. 

Those of skill in the art will know of various techniques common in the immunology arts for 
purification and/or concentration of polyclonal antibodies, as well as monoclonal antibodies 
see , for example, Coligan, et al (1991) Unit 9, Current Protocols in Immunology, Wiley 
Interscience). 

30 Preferably, however, the antibodies produced will be monoclonal antibodies 

("mAb's"). For preparation of monoclonal antibodies, immunization of a mouse or rat is 
preferred. The term "antibody" as used in this invention includes intact molecules as well as 
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fragments thereof, such as, Fab and F(ab') 2 which are capable of binding an epitopic 
determinant. Also, in this context, the term "mab f s of the invention" refers to monoclonal 
antibodies with specificity for the subject gene product. 

The general method used for production of hybridomas secreting mAbs is 
5 well known (Kohler and Milstein (1975) Nature, 256:495). Briefly, as described by Kohler 
and Milstein the technique comprised isolating lymphocytes from regional draining lymph 
nodes of five separate cancer patients with either melanoma, teratocarcinoma or cancer of 
the cervix, glioma or lung, (where samples were obtained from surgical specimens), pooling 
the cells, and fusing the cells with SHFP-1. Hybridomas were screened for production of 

10 antibody which bound to cancer cell lines. 

Confirmation of specificity among mAb's can be accomplished using 
relatively routine screening techniques (such as the enzyme-linked immunosorbent assay, or 
"ELISA") to determine the elementary reaction pattern of the mAb of interest. 

It is also possible to evaluate an mAb to determine whether it has the same 

15 specificity as a mAb of the invention without undue experimentation by determining 

whether the mAb being tested prevents a mAb of the invention from binding to the subject 
gene product isolated as described above. If the mAb being tested competes with the mAb 
of the invention, as shown by a decrease in binding by the mAb of the invention, then it is 
likely that the two monoclonal antibodies bind to the same or a closely related epitope. Still 

20 another way to determine whether a mAb has the specificity of a mAb of the invention is to 
preincubate the mAb of the invention with an antigen with which it is normally reactive, and 
determine if the mAb being tested is inhibited in its ability to bind the antigen. If the mAb 
being tested is inhibited then, in all likelihood, it has the same, or a closely related, epitopic 
specificity as the mAb of the invention. 

25 Antibodies fragments, e.g. single chain antibodies (scFv or others), can also 

be produced/selected using phage display technology. The ability to express antibody 
fragments on the surface of viruses that infect bacteria (bacteriophage or phage) makes it 

possible to isolate a single binding antibody fragment from a library of greater than 10^ 
nonbinding clones. To express antibody fragments on the surface of phage (phage display), 
30 an antibody fragment gene is inserted into the gene encoding a phage surface protein (pill) 
and the antibody fragment-pill fusion protein is displayed on the phage surface (McCafferty 
etal (1990) Nature, 348: 552-554; Hoogenboom etal (1991) Nucleic Acids Res. 19: 4133- 
4137). 



Since the antibody fragments on the surface of the phage are functional, 
phage bearing antigen binding antibody fragments can be separated from non-binding phage 
by antigen affinity chromatography (McCafferty et al (1990) Nature, 348: 552-554). 
Depending on the affinity of the antibody fragment, enrichment factors of 20 fold - 
5 1,000,000 fold are obtained for a single round of affinity selection. By infecting bacteria 
with the eluted phage, however, more phage can be grown and subjected to another round of 
selection. In this way, an enrichment of 1000 fold in one round can become 1,000,000 fold 
in two rounds of selection (McCafferty et al (1990) Nature, 348: 552-554). Thus even 
when enrichments are low (Marks et al (1991) J. Mol Biol 222: 581-597), multiple rounds 

10 of affinity selection can lead to the isolation of rare phage. Since selection of the phage 

antibody library on antigen results in enrichment, the majority of clones bind antigen after as 
few as three to four rounds of selection. Thus only a relatively small number of clones 
(several hundred) need to be analyzed for binding to antigen. 

Human antibodies can be produced without prior immunization by displaying 

15 very large and diverse V-gene repertoires on phage (Marks et al (1991) J. Mol Biol 222: 
581-597). In one embodiment natural Vh and Vl repertoires present in human peripheral 
blood lymphocytes are were isolated from unimmunized donors by PCR. The V-gene 
repertoires were spliced together at random using PCR to create a scFv gene repertoire 
which is was cloned into a phage vector to create a library of 30 million phage antibodies 

20 (Id.). From this single "naive" phage antibody library, binding antibody fragments have 

been isolated against more than 17 different antigens, including haptens, polysaccharides and 
proteins (Marks et al (1991) J. Mol Biol 222: 581-597; Marks et al (1993). 
Bio/Technology. 10: 779-783; Griffiths et al (1993) EMBOJ. 12: 725-734; Clackson et al 
(1991) Nature. 352: 624-628). Antibodies have been produced against self proteins, 

25 including human thyroglobulin, immunoglobulin, tumor necrosis factor and CEA (Griffiths 
et al (1993) EMBO J. 12: 725-734). It is also possible to isolate antibodies against cell 
surface antigens by selecting directly on intact cells. The antibody fragments are highly 
specific for the antigen used for selection and have affinities in the 1 :M to 100 nM range 
(Marks et al (1991) J. Mol Biol 222: 581-597; Griffiths et al (1993) EMBO J. 12: 725- 

30 734). Larger phage antibody libraries result in the isolation of more antibodies of higher 
binding affinity to a greater proportion of antigens. 
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It will also be recognized that antibodies can be prepared by any of a number 
of commercial services {e.g., Berkeley antibody laboratories, Bethyl Laboratories, Anawa, 
Eurogenetec, etc.). 



II. Assay Optimization— Determining prognosticallv significant levels. 

5 The assays of this invention have immediate utility in detecting/predicting the 

likelihood of a cancer, in estimating survival from a cancer, in screening for agents that 
modulate the subject gene product activity, and in screening for agents that inhibit cell 
proliferation. In particular, for example, identification of an amplification in a gene of 
Figure 1 (genomic DNA) indicates the presence of a cancer and/or the predisposition to 

10 develop a cancer. Likewise, identification of a deletion in a gene of Figure 2 (genomic 
DNA) indicates the presence of a cancer and/or the predisposition to develop a cancer 

Methods of optimizing predictive/diagnostic assays are well known to those 
of ordinary skill in the art. Typically this involves determining "baseline levels" of gene 
activity in normal tissues and activity levels in pathological {i.e. tumor tissues). In 

15 particularly preferred embodiments, such levels are determined with appropriate controls for 
sample type, age, sex, developmental state, overall physiological state {e.g. in a non- 
pregnant as compared to a pregnant female), overall health, tumor type, etc. In a preferred 
embodiment, "baseline" {e.g., control) levels are determined from a normal (healthy) tissue 
from the same individual or from individuals of the same population. Alternatively, 

20 "baseline" and "pathological" levels are determined from "population" studies" that provide 
sufficient sample size and diversity that the influence of the various co-factors identified 
above (age, health, sex, etc.) can be of statistically evaluated. 

In a preferred embodiment, quantitative assays of gene level are deemed to 
show a positive result, e.g. elevated level, when the measured geme level is greater than the 

25 level measured or known for a control sample {e.g. either a level known or measured for a 
normal healthy mammal of the same species or a "baseline/reference" level determined at a 
different tissue and/or a different time for the same individual. In a particularly preferred 
embodiment, the assay is deemed to show a positive result {e.g., "a prognostically significant 
level") when the difference between sample and "control" is statistically significant {e.g. at 

30 the 85% or greater, preferably at the 90% or greater, more preferably at the 95% or greater 
and most preferably at the 98% or greater confidence level). 
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Ill, Methods of treating cancer — selection of adjuvant therapy 

Because of the ability to evaluate the presence of, or the predisposition to 
develop, a cancer, the assays of this invention make a useful component of a cancer therapy 
regimen. Thus, in one embodiment, gene activity can be used as a measure of disease 
5 progression, while in another embodiment gene activity is used to evaluate the necessity of 
an adjuvant therapy. 

"Adjuvant cancer therapy" refers to a method of treating cancer, such as 
chemotherapy, radiation therapy, surgery, reoperation, antihormone therapy, and 
immunotherapy, that is administered in combination with or following another method of 

10 cancer treatment. An "adjuvant cancer therapy" often represents an aggressive form of 

cancer treatment that is selected in view of a reduced survival expectancy and/or a detected 
level of the gene of interest that is elevated or decreased compared to a control level 
Adjuvant therapies are well known to those of skill in the art and include, but are not limited 
to chemotherapy, radiation therapy, primary surgery or reoperation, antihormone therapy, 

15 immunotherapy, and the like. "Chemotherapy", as used in this context, refers to the 

administration of chemical compounds to an animal with cancer that is aimed at killing or 
reducing the number of cancer cells within the animal. Generally, chemotherapeutic agents 
arrest the growth of or kill cells that are dividing or growing, such as cancer cells. 
Chemotherapeutic agents for use against cancer are well known to those of skill in the art 

20 include, but are not limited to doxirubicin, vinblastine, genistein, etc. 

"Radiation therapy" in this context refers to the administration of 
radioactivity to an animal with cancer. Radiation kills or inhibits the growth of dividing 
cells, such as cancer cells. The administration may be by an external source (e.g., a gamma 
source, a proton source, a molecular beam source, etc.) or may be by an implantable 

25 radioactive material. Radiation therapy includes "traditional" radiation treatment aimed at 
reduction or elimination of tumor volume or more aggressive radio-surgery techniques. 

Surgical methods refer to the direct removal or ablation of cells, e.g. cancer 
cells, from an animal. Most often, the cancer cells will be in the form of a tumor (e.g. a 
mammary tumor), which is removed from the animal. The surgical methods may involve 

30 removal of healthy as well as pathological tissue. "Reoperation" refers to surgery performed 
on an animal that has previously undergone surgery for treatment of the same pathology. 
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"Antihormone therapy" refers to the administration of compounds that 
counteract or inhibit hormones, such as estrogen or androgen, that have a mitogenic effect on 
cells. Often, these hormones act to increase the cancerous properties of cancer cells in vivo. 

Immunotherapy refers to methods of enhancing the ability of an animal's 
5 immune system to destroy cancer cells within the animal. This can involve the treatment 
with polyclonal or monoclonal antibodies that bind particular tumor-specific markers (e.g. 
IL-13 receptor, and Lewis Y (Le Y ) marker, etc.) help to direct cytotoxins of native immune 
system effectors to the tumor target. Immunotherapeutic methods are well know to those of 
skill in the art (see, .e.g, Pastan et a/. (1992) Ann. Rev. Biochem., 61: 331-354, Brinkman and 
10 Pastan (1994) Biochimica Biphysica Acta, 1 198: 27-45, etc.). 

TV. Screening for therapeutics 

It was also a discovery of this invention that downregulation of the activity of 
the genes of Figure 1 is expected to act prophylactically to prevent the development of 
cancers and/or to act therapeutically to reduce or eliminate a cancer. Thus, in one 
15 embodiment, this invention provides methods of screening for agents that modulate and 
preferably that down regulate gene activity. Downregulation, as used in this context, 
includes decrease in transcription and/or decrease in translation, and/or decrease in gene 
product activity. 

Preferred "screening" methods of this invention involve contacting a gene 
20 expressing cell (e.g., a cell capable of expressing a gene of Figure 1 or Figure 2) with a test 
agent; and (ii) detecting the level of gene activity (e.g. as described above), where a 
decreased level of gene activity as compared to the level of gene activity in a cell not 
contacted with the agent indicates that said agent inhibits or downregulates gene and/or 
inhibits proliferation of the cell. 
25 Virtually any agent can be tested in such an assay. Such agents include, but 

are not limited to natural or synthetic nucleic acids, natural or synthetic polypeptides, natural 
or synthetic lipids, natural or synthetic small organic molecules, and the like. In one 
preferred format, test agents are provided as members of a combinatorial library. 
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A) Combinatorial libraries (e.g.. small organic molecules). 

Conventionally, new chemical entities with useful properties are generated by 
identifying a chemical compound (called a "lead compound") with some desirable property 
or activity, creating variants of the lead compound, and evaluating the property and activity 

5 of those variant compounds. However, the current trend is to shorten the time scale for all 
aspects of drug discovery. Because of the ability to test large numbers quickly and 
efficiently, high throughput screening (HTS) methods are replacing conventional lead 
compound identification methods. 

In one preferred embodiment, high throughput screening methods involve 

10 providing a library containing a large number of potential therapeutic compounds (candidate 
compounds). Such "combinatorial chemical libraries" are then screened in one or more 
assays, as described below to identify those library members (particular chemical species or 
subclasses) that display a desired characteristic activity. The compounds thus identified can 
serve as conventional "lead compounds" or can themselves be used as potential or actual 

15 therapeutics. 

A combinatorial chemical library is a collection of diverse chemical 
compounds generated by either chemical synthesis or biological synthesis by combining a 
number of chemical "building blocks" such as reagents. For example, a linear combinatorial 
chemical library such as a polypeptide (e.g., mutein) library is formed by combining a set of 

20 chemical building blocks called amino acids in every possible way for a given compound 
length (i.e., the number of amino acids in a polypeptide compound). Millions of chemical 
compounds can be synthesized through such combinatorial mixing of chemical building 
blocks. For example, one commentator has observed that the systematic, combinatorial 
mixing of 100 interchangeable chemical building blocks results in the theoretical synthesis 

25 of 100 million tetrameric compounds or 10 billion pentameric compounds (Gallop et al 
(1994) 37(9): 1233-1250). 

Preparation and screening of combinatorial chemical libraries is well known 
to those of skill in the art. Such combinatorial chemical libraries include, but are not limited 
to, peptide libraries (see, e.g., U.S. Patent 5,010,175, Furka (1991) Int. J. Pept. Prot Res., 

30 37: 487-493, Houghton et al (1991) Nature, 354: 84-88). Peptide synthesis is by no means 
the only approach envisioned and intended for use with the present invention. Other 
chemistries for generating chemical diversity libraries can also be used. Such chemistries 
include, but are not limited to: peptoids (PCT Publication No WO 91/19735, 26 Dec. 1991), 



encoded peptides (PCT Publication WO 93/20242, 14 Oct. 1993), random bio-oligomers 
(PCT Publication WO 92/00091, 9 Jan. 1992), benzodiazepines (U.S. Pat. No. 5,288,514), 
diversomers such as hydantoins, benzodiazepines and dipeptides (Hobbs et al, (1993) Proc. 
Nat Acad. Sci. USA 90: 6909-6913), vinylogous polypeptides (Hagihara et al (1992) 
5 Amer. Chem. Soc. 1 14: 6568), nonpeptidal peptidomimetics with a Beta-D-Glucose 

scaffolding (Hirschmann et aL, (1992) J. Amer. Chem. Soc. 114: 9217-9218), analogous 
organic syntheses of small compound libraries (Chen et al. (1994) J. Amer. Chem. Soc. 116: 
2661), oligocarbamates (Cho, et al., (1993) Science 261:1303), and/or peptidyl phosphonates 
(Campbells al, (1994)/. Org. Chem. 59: 658). See, generally, Gordons al, (1994) J. 

10 Med. Chem. 37:1385, nucleic acid libraries {see, e.g., Strategene, Corp.), peptide nucleic acid 
libraries {see, e.g., U.S. Patent 5,539,083) antibody libraries {see, e.g., Vaughn et al (1996) 
Nature Biotechnology, 14(3): 309-314), and PCT/US96/10287), carbohydrate libraries {see, 
e.g., Liang et al (1996) Science, 274: 1520-1522, and U.S. Patent 5,593,853), and small 
organic molecule libraries {see, e.g., benzodiazepines, Baum (1993) C&EN, Jan 18, page 33, 

15 isoprenoids U.S. Patent 5,569,588, thiazolidinones and metathiazanones U.S. Patent 

5,549,974, pyrrolidines U.S. Patents 5,525,735 and 5,519,134, morpholino compounds U.S. 
Patent 5,506,337, benzodiazepines 5,288,514, and the like). 

Devices for the preparation of combinatorial libraries are commercially 
available {see, e.g., 357 MPS, 390 MPS, Advanced Chem Tech, Louisville KY, Symphony, 

20 Rainin, Woburn, MA, 433A Applied Biosystems, Foster City, CA, 9050 Plus, Millipore, 
Bedford, MA). 

A number of well known robotic systems have also been developed for 
solution phase chemistries. These systems include automated workstations like the 
automated synthesis apparatus developed by Takeda Chemical Industries, LTD. (Osaka, 

25 Japan) and many robotic systems utilizing robotic arms (Zymate II, Zymark Corporation, 
Hopkinton, Mass.; Orca, Hewlett-Packard, Palo Alto, Calif.) which mimic the manual 
synthetic operations performed by a chemist. Any of the above devices are suitable for use 
with the present invention. The nature and implementation of modifications to these devices 
(if any) so that they can operate as discussed herein will be apparent to persons skilled in the 

30 relevant art. In addition, numerous combinatorial libraries are themselves commercially 
available {see, e.g., ComGenex, Princeton, N.J., Asinex, Moscow, Ru, Tripos, Inc., St. 
Louis, MO, ChemStar, Ltd, Moscow, RU, 3D Pharmaceuticals, Exton, PA, Martek 
Biosciences, Columbia, MD, etc.). 



High Throughput Screening 

Any of the assays for compounds modulating gene activity level described 
herein are amenable to high throughput screening. Preferred assays thus detect enhancement 
or inhibition of gene transcription, inhibition or enhancement of polypeptide expression, and 
5 inhibition or enhancement of polypeptide activity. 

High throughput assays for the presence, absence, or quantification of 
particular nucleic acids or protein products are well known to those of skill in the art. 
Similarly, binding assays and reporter gene assays are similarly well known. Thus, for 
example, U.S. Patent 5,559,410 discloses high throughput screening methods for proteins, 

10 U.S. Patent 5,585,639 discloses high throughput screening methods for nucleic acid binding 
{i.e., in arrays), while U.S. Patents 5,576,220 and 5,541,061 disclose high throughput 
methods of screening for ligand/antibody binding. 

In addition, high throughput screening systems are commercially available 
{see, e.g., Zymark Corp., Hopkinton, MA; Air Technical Industries, Mentor, OH; Beckman 

15 Instruments, Inc. Fullerton, CA; Precision Systems, Inc., Natick, MA, etc.). These systems 
typically automate entire procedures including all sample and reagent pipetting, liquid 
dispensing, timed incubations, and final readings of the microplate in detector(s) appropriate 
for the assay. These configurable systems provide high throughput and rapid start up as well 
as a high degree of flexibility and customization. The manufacturers of such systems 

20 provide detailed protocols for various high throughput systems. Thus, for example, Zymark 
Corp. provides technical bulletins describing screening systems for detecting the modulation 
of gene transcription, ligand binding, and the like. 

V, Reducing gene activity levels in cells 

In another embodiment, this invention provides methods of reducing gene 
25 activity levels in a cell. In this context, a reduction of activity is a decrease in activity as 
compared to the same cell in an "untreated" condition. 

Methods of reducing activity levels of a particular gene or gene product are 
well known to those of skill in the art. Such methods include, but are not limited to targeting 
transcription or translation, e.g. by the use of antisense molecules or ribozymes, by targeting 
30 transciption factors, e.g. with antibodies or DNA binding proteins, and by targeting the 
polypeptide product, e.g. by competition with inactivive binding agents (e.g. muteins), by 
direct blocking, e.g. by binding with antibodies or other ligands, etc. 
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A) Antisense molecules. 

Gene activity can be downregulated, or entirely inhibited, by the use of 
antisense molecules. An "antisense sequence or antisense nucleic acid" is a nucleic acid is 
complementary to the coding mRNA nucleic acid sequence or a subsequence thereof. 
5 Binding of the antisense molecule to the mRNA interferes with normal translation of the 
polypeptide. 

Thus, in accordance with preferred embodiments of this invention, preferred 
antisense molecules include nucleic acids (e.g. oligonucleotides and oligonucleotide analogs) 
that are hybridizable with messenger RNA. This relationship is commonly denominated as 
10 "antisense." The antisense nucleic acids analogs are able to inhibit the function of the RNA, 
either its translation into protein, its translocation into the cytoplasm, or any other activity 
necessary to its overall biological function. The failure of the messenger RNA to perform all 
or part of its function results in a reduction or complete inhibition of expression of 
polypeptides. 

15 In the context of this invention, the term "oligonucleotide" refers to a 

polynucleotide formed from naturally-occurring bases and/or cyclofuranosyl groups joined 
by native phosphodiester bonds. This term effectively refers to naturally-occurring species 
or synthetic species formed from naturally-occurring subunits or their close homologs. The 
term "oligonucleotide" may also refer to moieties which function similarly to 

20 oligonucleotides, but which have non naturally-occurring portions. Thus, oligonucleotides 
may have altered sugar moieties or inter-sugar linkages. Exemplary among these are the 
phosphorothioate and other sulfur containing species which are known for use in the art. In 
accordance with some preferred embodiments, at least one of the phosphodiester bonds of 
the oligonucleotide has been substituted with a structure which functions to enhance the 

25 ability of the compositions to penetrate into the region of cells where the RNA whose 
activity is to be modulated is located. It is preferred that such substitutions comprise 
phosphorothioate bonds, methyl phosphonate bonds, or short chain alkyl or cycloalkyl 
structures. In accordance with other preferred embodiments, the phosphodiester bonds are 
substituted with structures which are, at once, substantially non-ionic and non-chiral, or with 

30 structures which are chiral and enantiomerically specific. Persons of ordinary skill in the art 

will be able to select other linkages for use in the practice of the invention. 

Oligonucleotides may also include species that include at least some modified 

base forms. Thus, purines and pyrimidines other than those normally found in nature may be 
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so employed. Similarly, modifications on the furanosyl portions of the nucleotide subunits 
may also be effected, as long as the essential tenets of this invention are adhered to. 
Examples of such modifications are 2 ? -0-alkyl- and 2'-halogen-substituted nucleotides. 
Some specific examples of modifications at the T position of sugar moieties which are 
5 useful in the present invention are OH, SH, SCH 3 , F, OCH 3 , OCN, 0(CH 2 )[n]NH 2 or 
0(CH 2 )[n]CH 3 , where n is from 1 to about 10 5 and other substituents having similar 
properties. 

Such oligonucleotides are best described as being functionally 
interchangeable with natural oligonucleotides or synthesized oligonucleotides along natural 

10 lines, but which have one or more differences from natural structure. All such analogs are 
comprehended by this invention so long as they function effectively to hybridize with 
messenger RNA to inhibit the function of that RNA. 

The oligonucleotides in accordance with this invention preferably comprise 
from about 3 to about 100 subunits. It is more preferred that such oligonucleotides and 

15 analogs comprise from about 8 to about 25 subunits and still more preferred to have from 
about 12 to about 20 subunits. As will be appreciated, a subunit is a base and sugar 
combination suitably bound to adjacent subunits through phosphodiester or other bonds. The 
oligonucleotides used in accordance with this invention may be conveniently and routinely 
made through the well-known technique of solid phase synthesis. Equipment for such 

20 synthesis is sold by several vendors, including Applied Biosystems. Any other means for 
such synthesis may also be employed, however, the actual synthesis of the oligonucleotides 
is well within the talents of the routineer. The preparation of other oligonucleotides such as 
phosphorothioates and alkylated derivatives is also well known to those of skill in the art. 

Ribozvmes 

25 In addition to antisense molecules, ribozymes can be used to target and inhibit 

transcription of the gene of interest A ribozyme is an RNA molecule that catalytically 
cleaves other RNA molecules. Different kinds of ribozymes have been described, including 
group I ribozymes, hammerhead ribozymes, hairpin ribozymes, RNAse P, and axhead 
ribozymes {see Castanotto et al. (1994) Adv. in Pharmacology 25: 289-317 for a general 

30 review of the properties of different ribozymes). 

The general features of hairpin ribozymes are described e.g., in Hampel et al 
(1990) Nucl Acids Res. 18: 299-304; Hampel et al (1990) European Patent Publication No. 



0 360 257; U.S. Patent No. 5,254,678. Methods of preparing are well known to those of skill 
in the art {see, e.g., Wong-Staal et al, WO 94/26877; Ojwang et al (1993) Proa Natl. Acad. 
ScL USA 90: 6340-6344; Yamada et al (1994) Human Gene Therapy 1 : 39-45; Leavitt et al 
(1995) Proa Natl Acad. ScL USA 92: 699-703; Leavitt et al (1994) Human Gene Therapy 
5 5: 1151-120; and Yamada etal (1994) Virology 205: 121-126). 

C) Competitive inhibition of polypeptide activity. 

Polypeptide activity of the subject gene of Figure 1 can be decreased by 
pro viking a competitive inhibitor of the polypeptide gene product. Methods of making 
inactive polypeptide variants (muteins) are well known to those of skill {see, e.g., U.S. 
10 Patents 5,486,463, 5,422,260, 5,116,943, 4,752,585, 4,518,504). Screening of such 

polypeptides {e.g., in the assays described above) can be accomplished with only routine 
experimentation. Using high-throughput methods, as described herein, literally thousands of 
agents can be screened in only a day or two. 

J» Modification of promoters to regulate endogenous expression. 

1 5 In still another embodiment, the expression of genes can be altered by altering 

the endogenous promoter. Methods of altering expression of endogenous genes are well 
known to those of skill in the art. Typically such methods involve altering or replacing all or 
a portion of the regulatory sequences controlling expression of the particular gene that is to 
be regulated. In a preferred embodiment, the regulatory sequences {e.g., the native 

20 promoter) upstream of the subject gene are altered. 

This is typically accomplished by the use of homologous recombination to 
introduce a heterologous nucleic acid into the native regulatory sequences. To downregulate 
expression of the gene product, simple mutations that either alter the reading frame or 
disrupt the promoter are suitable. To upregulate expression of the gene product, inducible 

25 promoters may be introduced to respond to any number of stimuli and thereby increase gene 
expression. 

In a particularly preferred embodiment, nucleic acid sequences comprising 
the structural gene in question or upstream sequences are utilized for targeting heterologous 
recombination constructs. The use of homologous recombination to alter expression of 
30 endogenous genes is described in detail in U.S. Patent 5,272,071, WO 91/09955, WO 
93/09222, WO 96/29411, WO 95/31560, and WO 91/12650. 
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E) Use of other molecules. 

Numerous other approaches can be taken to downregulate gene activity. As 
indicated above, particularly using high throughput screening methods, literally thousands of 
compounds can be tested for ability to alter (e.g. downregulate) gene activity. Any one or 
5 more of the compounds identified above or in such screening systems can be used to 
modulate activity. 

F) Administration of gene modulators. 

The compounds that modulate (e.g. downregulate) activity can be 
administered by a variety of methods including, but not limited to parenteral, topical, oral, or 

10 local administration, such as by aerosol or transdermally, for prophylactic and/or therapeutic 
treatment. The pharmaceutical compositions can be administered in a variety of unit dosage 
forms depending upon the method of administration. For example, unit dosage forms 
suitable for oral administration include powder, tablets, pills, capsules and lozenges. It is 
recognized that the gene product modulators (e.g. antibodies, antisense constructs, 

15 ribozymes, small organic molecules, etc.) when administered orally, must be protected from 
digestion. This is typically accomplished either by complexing the molecule(s) with a 
composition to render it resistant to acidic and enzymatic hydrolysis or by packaging the 
molecule(s) in an appropriately resistant carrier such as a liposome. Means of protecting 
agents from digestion are well known in the art. 

20 The compositions for administration will commonly comprise a modulator 

dissolved in a pharmaceutically acceptable carrier, preferably an aqueous carrier. A variety 
of aqueous carriers can be used, e.g., buffered saline and the like. These solutions are sterile 
and generally free of undesirable matter. These compositions may be sterilized by 
conventional, well known sterilization techniques. The compositions may contain 

25 pharmaceutically acceptable auxiliary substances as required to approximate physiological 
conditions such as pH adjusting and buffering agents, toxicity adjusting agents and the like, 
for example, sodium acetate, sodium chloride, potassium chloride, calcium chloride, sodium 
lactate and the like. The concentration of active agent in these formulations can vary widely, 
and will be selected primarily based on fluid volumes, viscosities, body weight and the like 

30 in accordance with the particular mode of administration selected and the patient's needs. 

Thus, a typical pharmaceutical composition for intravenous administration 
would be about 0.1 to 10 mg per patient per day. Dosages from 0.1 up to about 100 mg per 
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patient per day may be used, particularly when the drug is administered to a secluded site 
and not into the blood stream, such as into a body cavity or into a lumen of an organ. 
Substantially higher dosages are possible in topical administration. Actual methods for 
preparing parenterally administrable compositions will be known or apparent to those skilled 
5 in the art and are described in more detail in such publications as Remington's 

Pharmaceutical Science, 15th ed., Mack Publishing Company, Easton, Pennsylvania (1980). 

The compositions containing modulators can be administered for therapeutic 
or prophylactic treatments. In therapeutic applications, compositions are administered to a 
patient suffering from a disease {e.g., an epithelial cancer) in an amount sufficient to cure or 

10 at least partially arrest the disease and its complications. An amount adequate to accomplish 
this is defined as a "therapeutically effective dose." Amounts effective for this use will 
depend upon the severity of the disease and the general state of the patient's health. Single 
or multiple administrations of the compositions may be administered depending on the 
dosage and frequency as required and tolerated by the patient. In any event, the composition 

1 5 should provide a sufficient quantity of the agents of this invention to effectively treat the 
patient. 

VL Kits for use in diagnostic and/or prognostic applications. 

For use in diagnostic, research, and therapeutic applications suggested above, 
kits are also provided by the invention. In the diagnostic and research applications such kits 

20 may include any or all of the following: assay reagents, buffers,nspecific nucleic acids or 
antibodies {e.g. full-size monoclonal or polyclonal antibodies, single chain antibodies {e.g., 
scFv), or other gene product binding molecules), mid other hybridization probes and/or 
primers, and/or substrates for polypeptide gene products. A therapeutic product may include 
sterile saline or another pharmaceutically acceptable emulsion and suspension base. 

25 In addition, the kits may include instructional materials containing directions 

{i.e., protocols) for the practice of the methods of this invention. While the instructional 
materials typically comprise written or printed materials they are not limited to such. Any 
medium capable of storing such instructions and communicating them to an end user is 
contemplated by this invention. Such media include, but are not limited to electronic storage 

30 media {e.g., magnetic discs, tapes, cartridges, chips), optical media {e.g., CD ROM), and the 
like. Such media may include addresses to internet sites that provide such instructional 
materials. 



EXAMPLES 

The following examples are offered to illustrate, but not to limit the claimed 

invention. 

Example 1 : Genome Wide Screening of Gene Copy Number Changes on the NCI60 Cell 
5 Lines, 

It is understood that the examples and embodiments described herein are for 
illustrative purposes only and that various modifications or changes in light thereof will be 
suggested to persons skilled in the art and are to be included within the spirit and purview of 
this application and scope of the appended claims. All publications, patents, and patent 
1 0 applications cited herein are hereby incorporated by reference in their entirety for all 
purposes. 

Genome copy number changes are common in cancer and are presumed to 
play a role in tumor progression and response to therapy. The availability of cell lines 
carrying representative abnormalities makes it possible to study the phenotypic 

15 consequences in a systematic way. To this end, we applied array CGH to analyze genome 
copy abnormalities in the 60 cell lines of the NCI drug-screening panel. -450 loci containing 
cancer-related genes involved in carcinogeneisis, apoptosis, cell cycle regulation and signal 
transduction were analyzed. We studied the biological consequences of gene amplification 
and deletion, and to study the characteristics of chemotherapeutic agents that modify 

20 expression of the genes. We performed such copy analysis on the following cell lines: 

• Breast 

MCF-7, MCF-7/ADR, HS578T, MDA-MB231, MDA-MB435, MDA-N, BT549, 
T47D 

# Ovarian 

25 OVCAR3, OVCAR4, OVCAR5, OVCAR8, IGROV1, SKOV3 

# Lung 

NCI-H23, NCI-H226, NCI-H322M, NCI-H460, NCI-H522, A549, EKVX, HOP62, 
HOP92 

• CNS 

30 SNB19, SNB75, U251, SF268, SF29S, SF539 
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Prostate 

DU145, PC3 

• Colon 

HT29, HCC2998, HCT116, SW620, COLO205, HCT15, KM12 

5 * Renal 

U031, SN12C, A498, CAKI1, RXF393L, ACHN, 786-0, TK10 

* Melanoma 

LOX-IMVI, MALME-3M, SK-MEL2, SK-MEL5, SK-MEL28, UACC62, UACC257, 
M14 

10 * Leukemia 

CCRF-CEM, K562, MOLT4, HL60, RPMI8226, SR 

Array CGH allows detection of single copy number changes and has high 
dynamic range. The spectrum of abnormalities varied considerably across all cell lines 
1 5 including among those derived from the same tumor type. The most frequent copy number 
changes were gains involving MYC, PCK1, PTPN1, CYP24, MYBL2, SPOll andPRKCBPl, 
and losses involving MTS1, AF9, IRF2, BCL2, MLLT2 and FMR2 in 60 cell lines. This 
information can be used to select lines from the NCI 60 cell line panel for studies of the 
biological consequences of specific genomic alterations. 

20 
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